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Preface 


Except in this paragraph and on the title page, all 
instances of “PowerPC AS” in this document should 
be interpreted as referring to the tags inactive subset 
of the full PowerPC AS Architecture unless otherwise 
stated. 


This document defines the additional instructions and 
facilities, beyond those of the PowerPC AS User 
Instruction Set Architecture and PowerPC AS Virtual 
Environment Architecture, that are provided by the 
PowerPC AS Operating Environment Architecture. It 
covers instructions and facilities not available to the 
application programmer, affecting storage control, 
interrupts, and timing facilities. 


Other related documents define the PowerPC AS User 
Instruction Set Architecture, the PowerPC AS Virtual 
Environment Architecture, and PowerPC AS Imple- 
mentation Features. Book |, PowerPC AS User 
Instruction Set Architecture defines the base instruc- 
tion set and related facilities available to the applica- 
tion programmer. Book Il, PowerPC AS Virtual 
Environment Architecture defines the storage model 
and related instructions and facilities available to the 
application programmer, and the Time Base as seen 
by the application programmer. Book IV, PowerPC 
AS Implementation Features defines the implementa- 
tion-dependent aspects of a particular implementa- 
tion. 


As used in this document, the term “PowerPC AS 
Architecture” refers to the instructions and facilities 
described in Books |, Il, and Ill. The description of the 
instantiation of the PowerPC AS Architecture in a 
given implementation includes also the material in 
Book IV for that implementation. 


User Responsibilities 


m Do not make any unauthorized alterations to the 
document (user notes are permitted). 


m Destroy the entire document when it is super- 
seded, obsolete, or no longer needed. 


m Distribute copies of the document or portions of 
the document only to authorized persons. 


m Verify the version prior to use. The version ver- 
ification procedure is described later in this 
preface. 


m Verify completeness prior to use. The last page 
is labeled “Last Page - End of Document”. The 
end of the Table of Contents shows the last page 
number. 


m Report any deviations from these procedures to 
the document owner. 


Next Scheduled Review 

There is no scheduled review. 

Approval Process 

The process used by the Processor Architecture 
Review Board (PARB) to approve or reject changes 
proposed for this architecture is documented at the 
following DFS directory: 
/.../austin.ibm.com/fs/projects/utds/server arch/process 


Approvals 


This version has been approved by the PARB. 


Version Verification 


See the PowerPC AS representative for your company. 


Preface iii 
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1.1 Overview ................ 1 
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1.3 Document Conventions ....... 1 
1.3.1 Definitions and Notation ..... 1 
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1.1 Overview 


Chapter 1 of Book |, PowerPC AS User Instruction Set 
Architecture describes computation modes, compat- 
ibility with the POWER Architecture, document con- 
ventions, a general systems overview, instruction 
formats, and storage addressing. This chapter aug- 
ments that description as necessary for the PowerPC 
AS Operating Environment Architecture. 


1.2 Compatibility with the 
POWER Architecture 


The PowerPC AS Architecture provides binary com- 
patibility for POWER application programs, except as 
described in the appendix entitled “Incompatibilities 
with the POWER Architecture” in Book |, PowerPC AS 
User Instruction Set Architecture. Binary compatibility 
is not necessarily provided for privileged POWER 
instructions. 


1.3 Document Conventions 


The notation and terminology used in Book | apply to 
this Book also, with the following substitutions. 


m For “system alignment error handler” substitute 
“Alignment interrupt”. 


m For “system data storage error handler” substi- 
tute “Data Storage interrupt”, “Data Segment 
interrupt”, or “Data Storage or Data Segment 
interrupt”, as appropriate. 


1.4 General Systems Overview ..... 3 
15 Exceptions ............... 3 
1.6 Synchronization ........... 3 
1.6.1 Context Synchronization ..... 3 
1.6.2 Execution Synchronization .... 4 
1.7 Logical Partitioning (LPAR) ..... 4 


m For “system error handler” substitute “interrupt”. 


m For “system floating-point enabled exception 
error handler” substitute “Floating-Point Enabled 
Exception type Program interrupt”. 


m For “system illegal instruction error handler” sub- 
stitute “Illegal Instruction type Program 
Interrupt”. 


m For “system instruction storage error handler” 
substitute “Instruction Storage interrupt”, 
“Instruction Segment interrupt”, or “Instruction 
Storage or Instruction Segment interrupt”, as 
appropriate. 


m For “system privileged instruction error handler” 
substitute “Privileged Instruction type Program 
interrupt”. 


m For “system service program” substitute “System 
Call interrupt”. 


m For “system trap handler” substitute “Trap type 
Program interrupt”. 


1.3.1 Definitions and Notation 


The definitions and notation given in Book |, PowerPC 
AS User Instruction Set Architecture are augmented 
by the following. 


m A real page is a 4 KB unit of real storage that is 
aligned at a 4 KB boundary. 


m The context of a program is the environment 
(e.g., privilege and relocation) in which the 
program executes. That context is controlled by 
the contents of certain System Registers, such as 
the MSR and SDR1, and of the address trans- 
lation tables. 
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An exception is an error, unusual condition, or 
external signal, that may set a status bit and may 
or may not cause an interrupt, depending upon 
whether the corresponding interrupt is enabled. 


An interrupt is the act of changing the machine 
state in response to an exception, as described in 
Chapter 5, “Interrupts” on page 51. 


A trap interrupt is an interrupt that results from 
execution of a Trap instruction. 


Additional exceptions to the rule that the 
processor obeys the sequential execution model, 
beyond those described in the section entitled 
“Instruction Fetching” in Book |, are the following. 


— A System Reset or Machine Check interrupt 
may occur. The determination of whether an 
instruction is required by the sequential exe- 
cution model is not affected by the potential 
occurrence of a System Reset or Machine 
Check interrupt. (The determination is 
affected by the potential occurrence of any 
other kind of interrupt.) 


Engineering Note 


Although External, Decrementer, and 
imprecise interrupts must be considered 
in determining whether an instruction is 
required by the sequential execution 
model, the fact that these interrupts are 
not required to be recognized at any spe- 
cific point in the instruction stream allows 
an implementation to halt instruction dis- 
patching and delay recognition of the 
interrupt until the processor comes into a 
state consistent with the sequential exe- 
cution model. Such an implementation 
need not consider these interrupts in 
determining whether an instruction is 
required by the sequential execution 
model. 


Instruction-caused precise interrupts 
must also be considered in determining 
whether an instruction is required by the 
sequential execution model. However, 
for these it is always possible to predict 
whether they might be caused by any 
given instruction and thus to determine 
whether subsequent instructions are sure 
to be required by the sequential exe- 
cution model. 
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— A context-altering instruction is executed 
(see Chapter 7, “Synchronization Require- 
ments for Special Registers and for Look- 
aside Buffers” on page 71). The context 
alteration need not take effect until the 
required subsequent synchronizing operation 
has occurred. 


m Hardware means any combination of hard-wired 
implementation, emulation assist, or interrupt for 
software assistance. In the last case, the inter- 
rupt may be to an architected location or to an 
implementation-dependent location. Any use of 
emulation assists or interrupts to implement the 
architecture is described in Book IV, PowerPC AS 
Implementation Features. 


a /,//,///,.... denotes a field that is reserved in an 
instruction, in a register, or in an architected 
storage table. 


1.3.2 Reserved Fields 


Some fields of certain storage tables may be written 
to automatically by hardware, e.g., Reference and 
Change bits in the Page Table. When the hardware 
writes to such a table, the following rules are obeyed. 


m Unless otherwise stated, no defined field other 
than the one(s) the hardware is specifically 
updating are modified. 


m Contents of reserved fields are either preserved 
by hardware or written as Os. No other changes 
to reserved fields are made. 


The handling of reserved bits in System Registers 
described in Book | applies here as well. The reader 
should be aware that reading and writing of some of 
these registers (e.g., the MSR) can occur as a side 
effect of processing an interrupt and of returning from 
an interrupt, as well as when requested explicitly by 
the appropriate instruction (e.g., mtmsrd). 


Programming Note 


System software should initialize reserved fields 
in architected storage tables (e.g., the Page Table) 
to Os and not keep data in them, as the fields may 
be assigned a meaning in some future version of 
the architecture. 
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1.4 General Systems Overview 


The processor or processor unit contains the 
sequencing and processing controls for instruction 
fetch, instruction execution, and interrupt action. 
Instructions that the processing unit can execute fall 
into three classes: 


m instructions executed in the Branch Processor 

m instructions executed in the Fixed-Point Processor 

m instructions executed in the Floating-Point 
Processor 


Almost all instructions executed in the Branch 
Processor, Fixed-Point Processor, and Floating-Point 
Processor are nonprivileged and are described in 
Book |, PowerPC AS User Instruction Set Architecture. 
Book Il, PowerPC AS Virtual Environment Architecture 
may describe additional nonprivileged instructions 
(e.g., Book Il describes some nonprivileged 
instructions for cache management). Instructions 
related to the privileged state of the processor, 
control of processor resources, control of the storage 
hierarchy, and all other privileged instructions are 
described here or in Book IV, PowerPC AS Implemen- 
tation Features. 


BRANCH > FIXED- 
POINT o DATA 
PROCESSOR PROCESSOR CACHE 
INSTRUCTION FLOATING- 
CACHE > POINT 
PROCESSOR | O 


MAIN MEMORY 


Í 


DIRECT MEMORY ACCESS 


Figure 1. Logical view of the PowerPC AS processor 
architecture 


1.5 Exceptions 


The following augments the list, given in Book |, of 
exceptions that can be caused directly by the exe- 
cution of an instruction: 


m the execution of a floating-point instruction when 
MSRpp= O (Floating-Point Unavailable interrupt) 


m an attempt to modify a hypervisor resource when 
the processor is in privileged but non-hypervisor 
state (see Section 1.7), or an attempt to execute 
a hypervisor-only instruction (e.g., tibie) when the 
processor is in privileged but non-hypervisor 
state. 


m the execution of a traced instruction (Trace inter- 
rupt) 


1.6 Synchronization 


The synchronization described in this section refers to 
the state of the processor that is performing the syn- 
chronization. 


1.6.1 Context Synchronization 


An instruction or event is context synchronizing if it 
satisfies the requirements listed below. Such 
instructions and events are collectively called context 
synchronizing operations. Examples of context syn- 
chronizing operations include the sc instruction, the 
isync instruction, the rfid instruction, and most inter- 
rupts. 


1. The operation causes instruction dispatching (the 
issuance of instructions by the instruction fetch 
mechanism to any instruction execution mech- 
anism) to be halted. 


2. The operation is not initiated or, in the case of 
isync, does not complete, until all instructions 
already in execution have completed to a point at 
which they have reported all exceptions they will 
cause. 


3. The instructions that precede the operation com- 
plete execution in the context (privilege, relo- 
cation, storage protection, etc.) in which they 
were initiated. 


4. If the operation directly causes an interrupt (e.g., 
sc directly causes a System Call interrupt) or is 
an interrupt, the operation is not initiated until no 
exception exists having higher priority than the 
exception associated with the interrupt (see 
Section 5.8, “Interrupt Priorities” on page 65). 


5. The instructions that follow the operation will be 
fetched and executed in the context established 
by the operation. (This requirement dictates that 
any prefetched instructions be discarded and that 
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any effects and side effects of executing them 
out-of-order also be discarded, except as 
described in Section 4.2.4, “Performing Oper- 
ations Out-of-Order” on page 23.) 


A context synchronizing operation is necessarily exe- 
cution synchronizing; see Section 1.6.2, “Execution 
Synchronization”. Unlike the sync instruction, a 
context synchronizing operation does not affect the 
order in which storage accesses are performed with 
respect to other processors and mechanisms, or the 
order in which Reference and Change bit updates are 
performed. 


1.6.2 Execution Synchronization 


An instruction is execution synchronizing if it satisfies 
items 2 on page 3 and 3 on page 3 of the definition 
of context synchronization (see Section 1.6.1). syncis 
treated like isync with respect to item 2 on page 3 
(i.e., the conditions described in item 2 on page 3 
apply to the completion of sync). Examples of exe- 
cution synchronizing instructions are sync and 
mtmsrd. Also, all context synchronizing instructions 
are execution synchronizing. 


Unlike a context synchronizing operation, an exe- 
cution synchronizing instruction need not ensure that 
the instructions following that instruction will execute 
in the context established by that instruction. This 
new context becomes effective sometime after the 
execution synchronizing instruction completes and 
before or at a subsequent context synchronizing oper- 
ation. 


1.7 Logical Partitioning (LPAR) 


The Logical Partitioning (LPAR) facility permits 
processors and portions of real storage to be 
assigned to logical collections called partitions, such 
that a program executing on a processor in one parti- 
tion cannot interfere with any program executing ona 
processor in a different partition. This isolation can 
be provided for both problem state and privileged 
state programs, by using a layer of trusted software, 
called a hypervisor program (or simply a 
“hypervisor”), and the resources provided by this 
facility to manage system resources. (A hypervisor is 
a program that runs in hypervisor state; see below.) 


The number of partitions supported is implementa- 
tion-dependent. 


A processor is in only one partition at any given time. 
Partitions can be defined without consideration of the 
physical configuration of the system (e.g., shared 
caches, organization of the storage hierarchy). 


A processor may be removed from one partition and 
assigned to a different partition while other 
processors continue to execute programs in their 
respective partitions. The operations necessary to 
assign a processor to a different partition are imple- 
mentation-dependent. 


The following resources are provided to support 
logical partitioning. 


1. HV bit of the MSR 


This bit, along with MSRpp, controls whether the 
processor is in hypervisor state (see Section 2.2.3 
on page 7). 


2. Logical Partitioning Environment Selector (LPES) 
bit 
This bit affects how storage is accessed in real 
addressing mode (see Section 4.2.6 on page 26 
and Section 4.9.2 on page 40) and how the MSR 
is set when an interrupt occurs (see Section 5.5 
on page 54). 


Programming Note 


LPES=0 provides an environment in which 
only the hypervisor can run with address 
translation disabled and in which all inter- 
rupts invoke the hypervisor. This value 
(along with MSRyy=1) can also be used in a 
system that is not partitioned, to permit the 
operating system to access all system 


resources. 


3. Real mode storage access control 


The Real Mode Offset Register (RMOR), Real 
Mode Limit Register (RMLR), and Real Mode 
Caching Inhibited bit control access to storage in 
real addressing mode, as described in Section 
4.2.6 on page 26. 


4. Logical Partition Identity Register (LPIDR) 


This register contains a value that identifies the 
partition to which the processor is assigned. 


With the exception of MSRyy, the format and contents 
of these resources, the conditions that must be estab- 
lished before they are altered, the means provided for 
altering them, and the software synchronization 
required in order to make the alterations effective are 
implementation-dependent. 


With the exception of MSRyy, the resources defined 
above and those in the following list are hypervisor 
resources. 


m All implementation-specific resources, including 
implementation-specific registers (e.g., “HID” reg- 
isters), that control hardware functions or affect 
the results of instruction execution. Examples 
include resources that disable caches, disable 
hardware error detection, set breakpoints, control 
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power management, or significantly affect per- 
formance. 


m ME bit of the MSR 


m SDR1, EAR, SPRGO, Time Base, PIR, DABR (if 
implemented) 


m the large virtual page size, if a means is provided 
by which software can alter it 


The contents of a hypervisor resource can be modi- 
fied by the execution of an instruction (e.9., mispr) 
only in hypervisor state (MSRyy pa = 0b10). Whether 
an attempt to modify the contents of a given 
hypervisor resource, other than MSRye, in privileged 
but non-hypervisor state (MSRyy pa =  0b00) is 
ignored (i.e., treated as a no-op) or causes a Privi- 
leged Instruction type Program interrupt is implemen- 
tation-dependent. An attempt to modify MSRye in 
privileged but non-hypervisor state is ignored (i.e., 
the bit is not changed). 


Engineering Note 


Causing a Privileged Instruction type Program 
interrupt if attempt is made to modify the contents 
of a hypervisor resource in privileged but non- 
hypervisor state facilitates the debugging of soft- 
ware. 


The tlbie and tlbsync instructions can be executed 
only in hypervisor state; see the descriptions of these 
instructions on pages 47 and 48. 


In general, if software violates a rule that is stated in 
the Books using the word “must” (e.g., “this field 
must be set to 0”) the results are boundedly unde- 
fined. The only exception is that if hypervisor soft- 
ware violates such a rule that pertains to the contents 
of a hypervisor resource, to accessing storage in real 
addressing mode, or to using the tlbie and tlbsync 
instructions, the results are undefined, and may 
include altering resources belonging to other parti- 
tions, causing the system to “hang”, etc. 


Programming Note 


Because the SPRs listed above are privileged for 
writing, an attempt to modify the contents of any 
of these SPRs in problem state (MSRpp=1) using 
mtspr causes a Privileged Instruction type 
Program exception, and similarly for MSRye_. 


If the hypervisor sets a breakpoint for an oper- 
ating system program without verifying the 
requested breakpoint conditions, the breakpoint 
could cause an unexpected Data Storage interrupt 
when the hypervisor is executing. 


Architecture Note 


MSRme the SPRs listed above, and the large 
virtual page size are hypervisor resources 
because they must be altered only by hypervisor 
software. Consequences of permitting alteration 
by non-hypervisor software include the following. 


MSRwye: Non-hypervisor software could cause a 
subsequent Machine Check to cause a 
system-wide Checkstop. 


SDR1, large virtual page size: Non-hypervisor 
software could access storage not allo- 
cated to the partition in which it is 
running. 


EAR:  Non-hypervisor software could access 
memory controller resources (corre- 
sponding to values of EARpip) not allo- 
cated to the partition in which it is 
running. 


SPRGO: Non-hypervisor software could cause the 
hypervisor to use invalid data (see the 
intended use of this register described in 
Section 3.3.3). 


Time Base: Non-hypervisor software could cause 
the Time Base on one processor to be 
out of synchronization with that on other 
processors, with the result that the first 
processor's Time Base would have to be 
resynchronized as part of assigning the 
processor to a different partition. 


PIR: Because the PIR is used in communi- 
cation with other processors and with I/O 
devices, non-hypervisor software could 
cause the system to “hang”. 


DABR: Non-hypervisor software could set a 
breakpoint in hypervisor data. 


Engineering Note 


On an implementation that provides a Perform- 
ance Monitor facility (e.g., see Appendix E), any 
Performance Monitor resource having the prop- 
erty that alteration of the resource by a processor 
in one partition could affect the integrity of other 
partitions must be a hypervisor resource. (lt is 
expected that most Performance Monitor 
resources will not have this property.) 


Control bits that are hypervisor resources should 
not be defined in registers or resources that 
contain bits that can be altered by non-hypervisor 
programs. 
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Engineering Note 


The requirements for altering hypervisor 
resources must be such that a processor assigned 
to one partition can be reassigned to a different 
partition without affecting the execution of pro- 
grams on other processors. In addition, 
deterministic means must be provided to perform 
other functions associated with reassigning a 
processor to a different partition, such as invali- 
dating SLB, TLB, and ERAT entries. 


The speed with which this reassignment can be 
performed may affect how the LPAR facility is 
used. Decisions regarding how reassignment is 
accomplished in a given implementation must 
include consideration of the intended uses of the 
facility and of the consequent performance 
requirements. 
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Chapter 2. Branch Processor 


2.1 Branch Processor Overview .... 7 

2.2 Branch Processor Registers .... 7 

2.2.1 Machine Status Save/Restore 
Register 0 ....... nn... 7 


2.2.2 Machine Status Save/Restore 


Register? ................. 7 
2.2.3 Machine State Register ...... 7 
2.3 Branch Processor Instructions ... 11 
2.3.1 System Linkage Instructions ... 11 


2.1 Branch Processor Overview 


This chapter describes the details concerning the reg- 
isters and the privileged instructions implemented in 
the Branch Processor that are not covered in Book |, 
PowerPC AS User Instruction Set Architecture. 


2.2 Branch Processor Registers 


2.2.1 Machine Status Save/Restore 
Register 0 


The Machine Status Save/Restore Register 0 (SRRO) 
is a 64-bit register. This register is used to save 
machine status on interrupts, and to restore machine 
status when an rfid instruction is executed. 


On interrupt, SRRO is set to the current or next 
instruction address. Thus if the interrupt occurs in 
32-bit mode, the high-order 32 bits of SRRO are set to 
O. When rfid is executed, the contents of SRRO are 
copied to the next instruction address (NIA), except 
that the high-order 32 bits of the NIA are set to O 
when returning to 32-bit mode. 


SRRO 
0 61 63 


Figure 2. Save/Restore Register 0 


In general, SRRO contains either the address of the 
instruction that caused the interrupt, or the address of 


the instruction to return to after an interrupt is ser- 
viced. 


2.2.2 Machine Status Save/Restore 
Register 1 


The Machine Status Save/Restore Register 1 (SRR1) 
is a 64-bit register. This register is used to save 
machine status on interrupts, and to restore machine 
status when an rfid instruction is executed. 


| SRR1 


0 63 


Figure 3. Save/Restore Register 1 


In general, when an interrupt occurs, bits 33:36 and 
42:47 of SRR1 are loaded with information specific to 
the interrupt type, and bits 0:32, 37:41, and 48:63 of 
the MSR are placed into the corresponding bit posi- 
tions of SRR1. 


SRR1 bits in the range 0:32, 37:41, and 48:63 may be 
treated as reserved in a given implementation if they 
correspond to MSR bits that are reserved or are 
treated as reserved in that implementation. 


2.2.3 Machine State Register 


The Machine State Register (MSR) is a 64-bit register. 
This register defines the state of the processor. On 
interrupt, the MSR bits are altered in accordance with 
Figure 27 on page 54. The MSR can also be modified 
by the mtmsr{d], sc, and rfid instructions. It can be 
read by the mfmsr instruction. 
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MSR 
0 


Figure 4. Machine State Register 


Below are shown the bit definitions for the Machine 


State Register. 


Architecture Note 


Defined MSR bits are classified as either full func- 
tion or partial function. Full function MSR bits are 
saved in SRR1 when an interrupt occurs and 
restored by rfid, while partial function MSR bits 
are not saved or restored. Full function MSR bits 
lie in the range 0:32, 37:41, and 48:63, and partial 
function MSR bits lie in the range 33:36 and 42:47. 


Reserved MSR bits in the “full function range” 
need not be saved or restored. If they are not 
restored then they must be written as 0; if they 
are not saved then the corresponding SRR1 bits 
must be written as 0. The properties of reserved 
bits in System Registers are such that this alter- 
native behavior does not conflict with the 
descriptions of sc, rfid, and interrupt processing 
elsewhere in this Book. 


Bit Description 
0 Sixty-Four-Bit Mode (SF) 


O The processor is in 32-bit mode. 
1 The processor is in 64-bit mode. 


1:2 Reserved 
Architecture Note 


Bit 2 will be among the last to be 
assigned a meaning. It was the ISF (Inter- 
rupt Sixty-Four Bit Mode) bit in earlier 
versions of the architecture. 


3 Hypervisor State (HV) 


O The processor is not in hypervisor state. 
1 If MSRpp=0 the processor is 


hypervisor state; otherwise the processor 


is not in hypervisor state. 


4:46 


47 


48 


49 
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Programming Note 


The privilege state of the processor is 
determined by MSRyy and MSRpp, as 
follows. 


HV PR 

0 0 privileged 

0 1 problem 

1 0 privileged and hypervisor 
1/1 problem 


MSRyy can be set to 1 only by the System 
Call instruction and some interrupts. It 
can be set to O only by the rfid instruction 
and possibly by some interrupts. 


Reserved 
Architecture Note 


Bit 45 will be among the last to be 
assigned a meaning. It was the POW 
(Power Management) bit in earlier ver- 
sions of the architecture. 


In the POWER Architecture, SRR15 is used 
by the Instruction Storage interrupt to 
indicate a loop in the translation mech- 
anism. Because of this, MSR3> will not be 
assigned a new meaning in the near 
future. 


Bit 46 is used on some implementations 
for an implementation-specific function. 
See the Book IV, PowerPC AS Implemen- 
tation Features document for the imple- 
mentation. (Bit 46 is the Temporary GPR 
Remapping (TGPR) bit on the 603.) 


Interrupt Little-Endian Mode (ILE) 


This bit is part of the optional Little-Endian 
facility; see the section entitled “Little-Endian” 
in Book |. 


If the Little-Endian facility is implemented, 
when an interrupt occurs this bit is copied to 
MSR.g to select the Endian mode for the 
context established by the interrupt. 


If the Little-Endian facility is not implemented, 
this bit is treated as reserved. 


External Interrupt Enable (EE) 


O External and Decrementer interrupts are 
disabled. 

1 External and Decrementer interrupts are 
enabled. 


Problem State (PR) 


O The processor is in privileged state. 
1 The processor is in problem state. 
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50 


51 


52 


53 


54 


55 


Programming Note 


Any instruction or event that sets MSRpa 
to 1 also sets MSR| and MSRp,p to 1. 


Floating-Point Available (FP) 


O The processor cannot execute any float- 
ing-point instructions, including floating- 
point loads, stores, and moves. 

1 The processor can execute floating-point 
instructions. 


Machine Check Enable (ME) 


O Machine Check interrupts are disabled. 
1 Machine Check interrupts are enabled. 


This bit is a hypervisor resource; see Section 
1.7, “Logical Partitioning (LPAR)” on page 4. 


Programming Note 


The only instruction that can alter MSRye 
is the rfid instruction. 


Floating-Point Exception Mode 0 (FEO) 


See below. 

Single-Step Trace Enable (SE) 

O The processor executes instructions 
normally. 


1 The processor generates a Single-Step 
type Trace interrupt after successfully 
completing the execution of the next 
instruction (unless that instruction is rfid, 
which is never traced). Successful com- 
pletion means that the instruction caused 
no other interrupt. 


Branch Trace Enable (BE) 


O The processor 
instructions normally. 

1 The processor generates a Branch type 
Trace interrupt after completing the exe- 
cution of a branch instruction, whether or 
not the branch is taken. See Book IV, 
PowerPC AS Implementation Features. 


executes branch 


Branch tracing may not be present on all 
implementations. If the function is not imple- 
mented, this bit is treated as reserved. 


Floating-Point Exception Mode 1 (FE1) 


See below. 


56:57 Reserved 


58 


59 


60 
61 


62 


63 


Architecture Note 


Bit 56 corresponds to the AL bit of the 
POWER Architecture (see the appendix 
entitled “Incompatibilities with the POWER 
Architecture” in Book |). 


Bit 57 will be among the last to be 
assigned a meaning. It was the IP (Inter- 
rupt Prefix) bit in earlier versions of the 
architecture. 


Instruction Relocate (IR) 


O Instruction address translation is off. 
1 Instruction address translation is on. 


Programming Note 


Any instruction or event that sets MSR,p 
to O also sets MSRpp to 0. 


Data Relocate (DR) 


O Data address translation is off. 
1 Data address translation is on. 


Programming Note 


Any instruction or event that sets MSRpr 
to O also sets MSRpp to 0. 


Reserved 
Performance Monitor Mark (PMM) 


This bit is part of the optional Performance 
Monitor facility; see Appendix E. If the Per- 
formance Monitor facility is not implemented 
or does not use this bit, this bit is treated as 
reserved. 


Recoverable Interrupt (Rl) 


O Interrupt is not recoverable. 
1 Interrupt is recoverable. 


Additional information about the use of this 
bit is given in Sections 5.4, “Interrupt 
Processing” on page 53, 5.5.1, “System Reset 
Interrupt” on page 55, and 5.5.2, “Machine 
Check Interrupt” on page 55. 


Little-Endian Mode (LE) 


This bit is part of the optional Little-Endian 
facility; see the section entitled “Little-Endian” 
in Book |. 


If the Little-Endian facility is implemented, this 
bit has the following meaning. 


O The processor is in Big-Endian mode. 
1 The processor is in Little-Endian mode. 


If the Little-Endian facility is not implemented, 
this bit is treated as reserved. 
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The Floating-Point Exception Mode bits FEO and FE1 Architecture Note 
A pa lu sall isa el The initial state of the MSR should be as follows: 
tecture; Bit Name Value 
FE0 FE1 Mode | 0 SF 1 
0 O Ignore Exceptions 1:2 unspecified 
0 1 Imprecise Nonrecoverable 3 HV 1 
1 0 Imprecise Recoverable 4:46 unspecified 
1 1 Precise 47 ILE 0 
48 EE 0 
49 PR 0 
50 FP 0 
51 ME 0 
52 FEO 0 
53 SE 0 
54 BE 0 
55 FE1 0 
56:57 unspecified 
58 IR 0 
59 DR 0 
60 unspecified 
61 PMM unspecified 
62 RI 0 
63 LE 0 
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2.3 Branch Processor Instructions 


2.3.1 System Linkage Instructions 


These instructions provide the means by which a 
program can call upon the system to perform a 
service, and by which the system can return from per- 
forming a service or from processing an interrupt. 


System Call SC-form 


sc LEV 
[POWER mnemonic: svca] 


17 HH HH II LEV | // |1]/ 
0 6 11 16 20 27 |30/31 


SRRO €., CIA + 4 
SRR133.36 42:47 € 0 


R10:32 37:41 48:63 € MSRo:32 37-41 48:63 
MSR € new value (see below) 


NIA 4 0x0000 0000 0000 0C00 


The effective address of the instruction following the 
System Call instruction is placed into SRRO. Bits 0:32, 
37:41, and 48:63 of the MSR are placed into the corre- 
sponding bits of SRR1, and bits 33:36 and 42:47 of 
SRR1 are set to zero. 


Then a System Call interrupt is generated. The inter- 
rupt causes the MSR to be altered as described in 
Section 5.5, “Interrupt Definitions” on page 54. 


The interrupt causes the next instruction to be fetched 
from effective address 0x0000 0000 0000 0CO00. 


The contents of the LEV field must be O or 1; other- 
wise the results are boundedly undefined. 


This instruction is context synchronizing. 


Special Registers Altered: 
SRRO SRR1 MSR 


The System Call instruction is described in Book |, 
PowerPC AS User Instruction Set Architecture, but 
only at the level required by an application pro- 
grammer. A complete description of this instruction 
appears below. 


Programming Note 


sc serves as both a basic and an extended mne- 
monic. The Assembler will recognize an sc mne- 
monic with one operand as the basic form, and an 
sc mnemonic with no operand as the extended 
form. In the extended form the LEV operand is 
omitted and assumed to be 0. 


Programming Note 
If LEV=1 the hypervisor is invoked. 
If LPES=1, executing this instruction with LEV=1 


is the only way that executing an instruction can 
cause hypervisor state to be entered. 


Because this instruction is not privileged, it is 
possible for application software to invoke the 


hypervisor. However, such invocation should be 
considered a programming error. 


Engineering Note ——————————————__ 
LEVo.5 must be ignored by the processor. 


Architecture Note 


The requirement that LEV,.5 contain zeros and be 
ignored by the processor permits these bits to be 
assigned a meaning in the future if that proves 
desirable. 


Compatibility Note 


For a discussion of POWER compatibility with 
respect to instruction bits 16:19 and 27:29, see the 
appendix entitled “Incompatibilities with the 
POWER Architecture” in Book I, PowerPC AS User 
Instruction Set Architecture. For compatibility 


with future versions of the PowerPC AS Architec- 
ture, these bits should be coded as zeros. 
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Return From Interrupt Doubleword 
XL-form 


rfid 

19 VI] HH VI] 18 / 
0 6 11 16 21 31 
MSR51 € (MSRg & SRR1s4) | ((7MSRg) & MSR54) 


MSR3 ¢MSRg & SRR13 
MSReg € SRR1sg | SRR14g 
MSRsg € SRRl5g | SRR14g 


MSR0:2 4:32 37:41 48:50 52:57 60:63 € SRR1 9:2 4:32 37:41 48:50 52:57 60:63 
NIA €.. SRR00-64 


If MSR3=1 then bits 3 and 51 of SRR1 are placed into 
the corresponding bits of the MSR. The result of 
ORing bits 58 and 49 of SRR1 is placed into MSRsg. 
The result of ORing bits 59 and 49 of SRR1 is placed 
into MSRso. Bits 0:2, 4:32, 37:41, 48:50, 52:57, and 
60:63 of SRR1 are placed into the corresponding bits 
of the MSR. 


If the new MSR value does not enable any pending 
exceptions, then the next instruction is fetched, under 
control of the new MSR value, from the address 
SRROy6 | 0b00 (when SF=1 in the new MSR value) 
or 320 | SRRO32:54 | 0b00 (when SF=0 in the new 
MSR value). If the new MSR value enables one or 
more pending exceptions, the interrupt associated 
with the highest priority pending exception is gener- 
ated; in this case the value placed into SRRO by the 
interrupt processing mechanism (see Section 5.4, 
“Interrupt Processing” on page 53) is the address of 
the instruction that would have been executed next 
had the interrupt not occurred. 


This instruction is privileged and context synchro- 
nizing. 


Special Registers Altered: 
MSR 


Programming Note 


If this instruction sets MSRpp to 1, it also sets 
MSR;p and MSRp,p to 1. 


This instruction is the only instruction that can be 
used to set MSRyy to O on all implementations. 
This instruction is the only instruction that can be 
used to alter MSRye. These bits can be altered 
by this instruction only if it is executed in 
hypervisor state. 
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Chapter 3. Fixed-Point Processor 


3.1 Fixed-Point Processor Overview .. 13 
3.2 Special Purpose Registers ..... 13 
3.3 Fixed-Point Processor Registers .. 13 
3.3.1 Data Address Register ...... 13 
3.3.2 Data Storage Interrupt Status 
Register. “a artes ota bake Foe asthe ke dE 14 
3.3.3 Software-Use SPRS ........ 14 


3.1 Fixed-Point Processor 
Overview 


This chapter describes the details concerning the reg- 
isters and the privileged instructions implemented in 
the Fixed-Point Processor that are not covered in 
Book |, PowerPC AS User Instruction Set Architecture. 


3.2 Special Purpose Registers 


Special Purpose Registers (SPRs) are read and 
written using the mfspr (page 18) and mtspr (page 17) 
instructions. Most SPRs are defined in other chapters 
of this book; see the index to locate those definitions. 


3.3.4 Control Register 
3.3.5 Processor Version Register ... 15 
3.3.6 Processor Identification Register 15 
3.4 Fixed-Point Processor Privileged 


Instructions ................ 16 
3.4.1 Move To/From System Register 
Instructions ................ 16 


3.3 Fixed-Point Processor 
Registers 


3.3.1 Data Address Register 


The Data Address Register (DAR) is a 64-bit register 
that is set by Data Storage, Data Segment, and Align- 
ment interrupts. See Section 5.5.3, “Data Storage 
Interrupt” on page 56, Section 5.5.4, “Data Segment 
Interrupt” on page 57, and Section 5.5.8, “Alignment 
Interrupt” on page 59. When one of these interrupts 
occurs, the DAR is set to an effective address associ- 
ated with the storage access caused by the inter- 
rupting instruction. If the interrupt occurs in 32-bit 
mode, the high-order 32 bits of the DAR are set to 0. 


DAR 
0 63 


Figure 5. Data Address Register 
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3.3.2 Data Storage Interrupt Status 
Register 


The Data Storage Interrupt Status Register (DSISR) is 
a 32-bit register that defines the cause of Data 
Storage and Alignment interrupts. See Section 5.5.3, 
“Data Storage Interrupt” on page 56 and Section 
5.5.8, “Alignment Interrupt” on page 59. 


0 31 


Figure 6. Data Storage Interrupt Status Register 


3.3.3 Software-Use SPRs 


SPRGO through SPRG3 are 64-bit registers provided 
for use by privileged software. 


0 63 


Figure 7. Software-use SPRs 


The following list describes the conventional uses of 
SPRGO through SPRGS. 


SPRGO 
Hypervisor software may keep a unique real 
address in this register to identify an area of 
storage reserved for use by the hypervisor first- 
level interrupt handler. This area must be unique 
for each processor in the system. 


SPRGO is a hypervisor resource; see Section 1.7, 
“Logical Partitioning (LPAR)” on page 4. 


SPRG1 
This register may be used as a scratch register by 
the first-level interrupt handler to save the contents 
of a GPR. That GPR then can be loaded from 
SPRGO and used as a base register to save other 
GPRs to storage. 


SPRG2 
This register may be used by the operating system 
as needed. 


SPRG3 
This register may be used by the operating system 
as needed. 


lt is optional whether SPRG3 can be read in 
problem state. On implementations that provide 
this ability, SPRG3 may be used for information, 


such as a “thread-id”, that the operating system 
makes available to application programs. 


m~ Programming Note 


On implementations for which SPRG3 can be 
read in problem state, operating systems must 
ensure that no sensitive data are left in SPRG3 
when a problem state program is dispatched, 
and operating systems for secure systems must 
ensure that SPRG3 cannot be used to imple- 
ment a “covert channel” between problem 
state programs. These requirements can be 
satisfied by clearing SPRG3 before passing 
control to a program that will run in problem 
state. 


On such implementations, SPRG3 can be used 
“orthogonally” for both the purpose described 
for it above and the purpose described for 
SPRG1. If this is done, SPRG1 can be used for 
some other purpose. 


m Engineering Note 


The ability to read SPRG3 in problem state is 
being phased into the architecture, and will 
become required in a future version of the 
architecture. 


3.3.4 Control Register 


The Control Register (CTRL) is a 32-bit register that 
controls an external I/O pin. This signal may be used 
for the following: 
m driving the RUN Light on a system operator panel 
m External interrupt routing 
Monitor event 


m Performance counting (see 


Appendix E, “Example Performance Monitor 
(Optional)” on page 93) 
III RUN 
0 31 
Bit Name Description 
31 RUN Run state bit 


All other fields are implementation-dependent. 


Figure 8. Control Register 


The CTRL RUN can be used by the operating system 
to indicate when the processor is doing useful work. 


The contents of the CTRL can be written by the mispr 
instruction and read by the mfspr instruction. Write 
access to the CTRL is privileged. Reads can be per- 
formed in privileged or problem state. 
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3.3.5 Processor Version Register 


The Processor Version Register (PVR) is a 32-bit read- 
only register that contains a value identifying the 
version and revision level of the processor. The con- 
tents of the PVR can be copied to a GPR by the mfspr 
instruction. Read access to the PVR is privileged; 
write access is not provided. 


| Version Revision | 


0 16 31 


Figure 9. Processor Version Register 


The PVR distinguishes between processors that differ 
in attributes that may affect software. It contains two 
fields. 


Version A 16-bit number that identifies the 
version of the processor. Different 
version numbers indicate major differ- 
ences between processors, such as which 
optional facilities and instructions are 
supported. 


Revision A 16-bit number that distinguishes 
between implementations of the version. 
Different revision numbers indicate minor 
differences between processors having 
the same version number, such as clock 
rate and Engineering Change level. 


Version numbers are assigned by the PowerPC AS 
Architecture process. Revision numbers are assigned 
by an implementation-defined process. 


Engineering Note 


Although the classification of a given difference 
between processors as “major” or “minor” is 
somewhat arbitrary, the following are examples of 
differences that generally should be considered 
“major”. 


m number and types of execution units 

m optional facilities and instructions supported 

m level of support of instructions (hard-wired or 
emulated) 

m Size, geometry, and management of caches 
and of TLBs 


The following are examples of differences that 
generally should be considered “minor”. 


m remapping a processor to a new technology 

m redesigning a critical path to increase clock 
rate 

m fixing bugs 


In general, any change to a processor should 
cause a new PVR value to be assigned. Even a 
seemingly trivial change that is not expected to 
be apparent to software should cause a new 
revision number to be assigned, in case the 
change is later discovered to have introduced an 
error that software must circumvent. 


3.3.6 Processor Identification 
Register 


The Processor Identification Register (PIR) is a 32-bit 
register that contains a value that can be used to dis- 
tinguish the processor from other processors in the 
system. The contents of the PIR can be copied to a 
GPR by the mfspr instruction. Read access to the PIR 
is privileged; write access, if provided, is described in 
the Book IV, PowerPC AS Implementation Features 
document for the implementation. 


PROCID 
0 31 


Bits Name Description 
0:31 PROCID Processor ID 


Figure 10. Processor Identification Register 


The means by which the PIR is initialized are imple- 
mentation-dependent (see Book IV). 


The PIR is a hypervisor resource; see Section 1.7, 
“Logical Partitioning (LPAR)” on page 4. 
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3.4 Fixed-Point Processor Privileged Instructions 


3.4.1 Move To/From System Register Instructions 


The Move To Special Purpose Register and Move 
From Special Purpose Register instructions are 
described in Book |, PowerPC AS User Instruction Set 
Architecture, but only at the level available to an 
application programmer. For example, no mention is 
made there of registers that can be accessed only in 
privileged state. The descriptions of these 
instructions given below extend the descriptions given 
in Book I, but do not list Special Purpose Registers 
that are defined in Book IV, PowerPC AS Implementa- 
tion Features. In the descriptions of these instructions 
given below, the “defined” SPR numbers are the SPR 
numbers shown in the figure for the instruction and 
the SPR numbers defined in Book IV for the instruc- 
tion, and similarly for “defined” registers. 


Extended mnemonics 


Extended mnemonics are provided for the mtspr and 
mfspr instructions so that they can be coded with the 
SPR name as part of the mnemonic rather than as a 
numeric operand. See Appendix A, “Assembler 
Extended Mnemonics” on page 83. 


Engineering Note 


SPR numbers that are not shown in Figure 11 or 
Figure 12 and are in the ranges shown below are 
reserved for implementation-specific uses. 


848 - 863 
880 - 895 
976 - 991 
1008 - 1023 


Implementation-specific registers must be privi- 
leged, and must comply with the other guidelines 
and limitations given in the Preface of Book |. 
SPR numbers for implementation-specific regis- 
ters must be registered in advance with the 
person responsible for the technical content of 
this document (see the cover page). 


Architecture Note 


SPR numbers that are in the ranges 28-29, 80-82, 
136-142, 144-159, 276-279, 512-639, and 972-973 
are used in some early implementations for imple- 
mentation-specific purposes. These SPR numbers 
will not be assigned a meaning in the PowerPC AS 
Architecture except after careful consideration of 
the effect of such assignment on existing imple- 
mentations. 
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Move To Special Purpose Register 
XFX-form 


mtspr SPR,RS 


31 RS spr 467 / 


n € sprg9 | spros 

if length (SPREG(n)) = 64 then 
SPREG (n) € (RS) 

else 


The SPR field denotes a Special Purpose Register, 
encoded as shown in Figure 11. The contents of reg- 
ister RS are placed into the designated Special 
Purpose Register. For Special Purpose Registers that 
are 32 bits long, the low-order 32 bits of RS are 
placed into the SPR. 


For this instruction, SPRs TBL and TBU are treated as 
separate 32-bit registers; setting one leaves the other 
unaltered. 


spro=1 if and only if writing the register is privileged. 
Execution of this instruction specifying a defined and 
privileged register when MSRpp=1 causes a Privi- 
leged Instruction type Program interrupt. 


Execution of this instruction specifying an SPR 
number that is not defined for the implementation 
causes either an Illegal Instruction type Program 
interrupt or one of the following. 


m if spro=0: boundedly undefined results 
m if spro=1: 
— if MSRpp=1: Privileged Instruction type 
Program interrupt 
— if MSRpp=0 and MSRyy=0: boundedly unde- 
fined results 
— if MSRpp=0 
results 


and MSRyy=1: undefined 


If the SPR field contains a value that is shown in 
Figure 11 but corresponds to an optional Special 
Purpose Register that is not provided by the imple- 
mentation, the effect of executing this instruction is 
the same as if the SPR number were not shown in the 
figure. 


Special Registers Altered: 
See Figure 11 


Compiler and Assembler Note 


For the mtspr and mfspr instructions, the SPR 
number coded in assembler language does not 
appear directly as a 10-bit binary number in the 
instruction. The number coded is split into two 
5-bit halves that are reversed in the instruction, 
with the high-order 5 bits appearing in bits 16:20 
of the instruction and the low-order 5 bits in bits 
11:15. This maintains compatibility with POWER 
SPR encodings, in which these two instructions 
have only a 5-bit SPR field occupying bits 11:15. 


SPR! Register Privi- 
decimal Spr5.9 Spro:4 Name leged 


1 00000 00001 
8 00000 01000 
9 00000 01001 
18 00000 10010 
19 00000 10011 
22 00000 10110 
25 00000 11001 
26 00000 11010 
27 00000 11011 
29 00000 11101 


152 00100 11000 


272 01000 10000 
273 01000 10001 
274 01000 10010 
275 01000 10011 
280 01000 11000 
282 01000 11010 


284 01000 11100 
285 01000 11101 


784-799 11000 1xxxx 
1013 11111 10101 


1 Note that the order of the two 5-bit halves 
of the SPR number is reversed. 

2 Part of the optional External Control facility 
(see Section 8.1). 

3 Part of the optional “Bridge” facility 
(see Section 9.1). 

4 Part of the optional Performance Monitor 
facility (see Appendix E). 

5 Part of the optional Data Address Breakpoint 
facility (see Section 8.2). 

6 This register is a hypervisor resource, and 
can be modified by this instruction only in 
hypervisor state (see Section 1.7). 


All SPR numbers not shown above, or in 
Figure 12, or in Book IV are reserved. 


Figure 11. SPR encodings for mtspr 


Programming Note 


For a discussion of software synchronization 
requirements when altering certain Special 
Purpose Registers, see Chapter 7, “Synchroniza- 
tion Requirements for Special Registers and for 
Lookaside Buffers” on page 71. 


Compatibility Note 


For a discussion of POWER compatibility with 
respect to SPR numbers not shown in the instruc- 
tion descriptions for mtspr and mfspr, see the 
appendix entitled “Incompatibilities with the 
POWER Architecture” in Book I, PowerPC AS User 
Instruction Set Architecture. 


Engineering Note 


Causing an interrupt if this instruction is executed 
specifying an SPR number that is not defined for 
the implementation facilitates the debugging of 
software. 
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Move From Special Purpose Register 
XFX-form 


mfspr RT,SPR 


31 RT spr 339 / 
0 6 11 21 31 


n € spre. | spros 
if length(SPREG(n)) = 64 then 
RT 4 SPREG(n) 


lse 
RT ¢ 320 | SPREG(n) 


The SPR field denotes a Special Purpose Register, 
encoded as shown in Figure 12. The contents of the 
designated Special Purpose Register are placed into 
register RT. For Special Purpose Registers that are 
32 bits long, the low-order 32 bits of RT receive the 
contents of the Special Purpose Register and the 
high-order 32 bits of RT are set to zero. 


spro=1 if and only if reading the register is privi- 
leged. Execution of this instruction specifying a 
defined and privileged register when MSRpp= 1 
causes a Privileged Instruction type Program inter- 
rupt. 


Execution of this instruction specifying an SPR 
number that is not defined for the implementation 
causes either an Illegal Instruction type Program 
interrupt or one of the following. 


m if spro=0: boundedly undefined results 
m if spro=1: 
— if MSRpp=1: Privileged Instruction type 
Program interrupt 
— if MSRpp=0: boundedly undefined results 


If the SPR field contains a value that is shown in 
Figure 12 but corresponds to an optional Special 
Purpose Register that is not provided by the imple- 
mentation, the effect of executing this instruction is 
the same as if the SPR number were not shown in the 
figure. 


Special Registers Altered: 
None 
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SPR! Register Privi- 
decimal Spr5.9 Spro:4 Name leged 


00000 00001 
00000 01000 
00000 01001 
00000 10010 
00000 10011 
00000 10110 
00000 11001 
00000 11010 
00000 11011 
00000 11101 


136 00100 01000 


272 01000 10000 
273 01000 10001 
274 01000 10010 
259,275 01000 n0011 
280 01000 11000 
282 01000 11010 
287 01000 11111 


768-799 11000 nxxxx perf mon“: 


1013 11111 10101 DABR 5 


1023 11111 11111 PIR 


1 Note that the order of the two 5-bit halves 
of the SPR number is reversed. 

2 Part of the optional External Control facility 
(see Section 8.1). 

3 Part of the optional “Bridge” facility 
(see Section 9.1). 

4 Part of the optional Performance Monitor 
facility (see Appendix E). 

5 Part of the optional Data Address Breakpoint 
facility (see Section 8.2). 

6 The ability to read SPRG3 in problem state is 
optional (see Section 3.3.3). If this ability is 
not provided by the implementation, SPR 
number 259 is treated as if it corresponded to 
an optional SPR that is not provided by the 
implementation. 

7 Reading the SPR is privileged if and only if 
n=1. 


Moving from the Time Base (TB and TBU) is 
accomplished with the mftb instruction, 
described in Book Il. 


All SPR numbers not shown above, or in 
Figure 11, or in Book IV are reserved. 


Note 
See the Notes that appear with mtspr. Elgure 145 SPR encodings:torimispr 
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Move To Machine State Register 
Doubleword X-form 


mtmsrd RS 


31 RS WI HI 178 / 
0 6 11 16 21 31 


MSRsg + (RS) 58 | (RS) 4g 
MSRag € (RS)so | (RS) 49 
MSRo:2 4:50 52:57 60:63 € (RS) 0:2 4:50 52:57 60:63 


The result of ORing bits 58 and 49 of register RS is 
placed into MSRsg. The result of ORing bits 59 and 49 
of register RS is placed into MSRso. Bits 0:2, 4:50, 
52:57, and 60:63 of register RS are placed into the 
corresponding bits of the MSR. 


This instruction is privileged. This instruction is exe- 
cution synchronizing except with respect to alter- 
ations to the LE bit; see Chapter 7, “Synchronization 
Requirements for Special Registers and for Lookaside 
Buffers” on page 71. 


In addition, alterations to the EE and RI bits are effec- 
tive as soon as the instruction completes. Thus if 
MSRee=0 and an External or Decrementer interrupt 
is pending, executing an mtmsrd instruction that sets 
MSRgg to 1 will cause the External or Decrementer 
interrupt to be taken before the next instruction is 
executed, if no higher priority exception exists (see 
Section 5.8, “Interrupt Priorities” on page 65). 


Special Registers Altered: 
MSR 


Programming Note 


If this instruction sets MSRpp to 1, it also sets 
MSR;p and MSRpp to 1. 


This instruction does not alter MSRyy or MSRye- 


Programming Note 


For a discussion of software synchronization 
requirements when altering certain MSR bits, see 
Chapter 7. 


Move From Machine State Register 
X-form 


mfmsr RT 

31 RT HH HH 83 / 
0 6 11 16 21 31 
RT € MSR 


The contents of the MSR are placed into register RT. 
This instruction is privileged. 


Special Registers Altered: 
None 
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4.4 Virtual Address Generation .... 28 
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4.4.2 SLB Search ............. 30 
4.5 Virtual to Real Translation ..... 31 


4.1 Storage Addressing 


A program references storage using the effective 
address computed by the processor when it executes 
a Load, Store, Branch, or Cache Management instruc- 
tion, or when it fetches the next sequential instruction. 
The effective address is translated to a real address 
according to procedures described in Section 4.3, 
“Address Translation Overview” on page 28 and fol- 


4.5.1 Page Table 
4.5.2 Storage Description Register 1 . 33 
4.5.3 Page Table Search ......... 34 
4.6 Data Address Compare 
4.7 Storage Control Bits 
4.7.1 Storage Control Bit Restrictions . 37 
4.7.2 Altering the Storage Control Bits 37 
4.8 Reference and Change Recording 37 


4.9 Storage Protection .......... 40 

4.9.1 Storage Protection, Address 
Translation Enabled ........... 40 

4.9.2 Storage Protection, Address 
Translation Disabled .......... 40 


4.10 Storage Control Instructions ... 41 
4.10.1 Cache Management Instructions 41 
4.10.2 Lookaside Buffer Management . 41 
4.10.2.1 SLB Management Instructions 42 
4.10.2.2 TLB Management Instructions 


(Optional) ................ 47 
4.11 Page Table Update 

Synchronization Requirements .... 49 
4.11.1 Page Table Updates ....... 49 
4.11.1.1 Adding a Page Table Entry .. 49 


4.11.1.2 Modifying a Page Table Entry 50 
4.11.1.3 Deleting a Page Table Entry . 50 


lowing sections. The real address is what is pre- 
sented to the storage subsystem. See Figure 13 on 
page 28. 


For a complete discussion of storage addressing and 
effective address calculation, see the section entitled 
“Storage Addressing” in Book |, PowerPC AS User 
Instruction Set Architecture. 
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Storage Control Overview 


m Real address space size is 2" bytes, m< 62; see 
Note 1. 


m Real page size is 21? bytes (4 KB). 
m Effective address space size is 26% bytes. 


m An effective address is translated to a virtual 
address via the Segment Lookaside Buffer (SLB). 


— Virtual address space size is 2" bytes, 
65< n< 80; see Note 2. 

— Segment size is 228 bytes (256 MB). 

— Number of virtual segments is 27-28; see 
Note 2. 

— Virtual page size is 2° bytes, 12<p< 28; two 
sizes are supported simultaneously, 4 KB 
(p=12) and a larger size; see Note 3. 


m A virtual address is translated to a real address 
via the Page Table. 


Notes: 


1. The value of m is implementation-dependent 
(subject to the maximum given above). When 
used to address storage, the high-order 62-m 
bits of the “62-bit” real address must be zeros. 

2. The value of n is implementation-dependent 
(subject to the range given above). In references 
to 80-bit virtual addresses elsewhere in this Book, 
the high-order 80-n bits of the “80-bit” virtual 
address are assumed to be zeros. 

3. The value of p for the larger virtual page size is 
implementation-dependent (subject to the range 
given above). 


4.2 Storage Model 


The storage model provides the following features. 


1. The architecture allows the storage implementa- 
tions to take advantage of the performance bene- 
fits of weak ordering of storage accesses between 
processors or between processors and |/O 
devices. 


2. The architecture provides instructions that allow 
the programmer to ensure a consistent and 
ordered storage state. 


* dcbf + Iwarx 
+ dcbst + Iwsync 
* eieio * stdcx. 
* icbi * stwex. 
e isync * sync 

+ Idarx + tibsync 


3. Storage accesses appear to be performed in 
program order with respect to the processor per- 
forming them but, in general, may be performed 
in different orders with respect to other 
processors and mechanisms. 


4. Storage consistency between processors, and 
between a processor and an I/O device, is con- 
trolled by software using the “WIM” storage 
control bits (see Section 4.7). These bits allow 
software to control whether a given storage 
location has any of the following attributes. 


m Write Through Required (W) 
m Caching Inhibited (1) 
m Memory Coherence Required (M) 


Engineering Note 


The architecture does not suggest or preclude any 
implementation of storage consistency supporting 
the features listed above. In particular, the imple- 
mentation may be a snoopy bus design, a central- 
ized cache directory design, or other design. 


4.2.1 Storage Exceptions 


A storage exception is an exception that causes an 
Instruction Storage interrupt, an Instruction Segment 
interrupt, a Data Storage interrupt, a Data Segment 
interrupt, or an Alignment interrupt. Attempting to 
fetch or execute an instruction causes a storage 
exception if certain conditions apply. Such conditions 
include the following. 


m The appropriate relocate bit in the MSR is set to 
1 and the effective address cannot be translated 
to a real address. 

m The access is not permitted by the storage pro- 
tection mechanism. 

m The access causes a Data Address Compare 
match or a Data Address Breakpoint match. 


In certain cases a storage exception may result in the 
“restart” of (re-execution of at least part of) a Load or 
Store instruction. See the section entitled “Instruction 
Restart” in Book Il, PowerPC AS Virtual Environment 
Architecture, and Section 5.6, “Partially Executed 
Instructions” on page 63 in this Book. 
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4.2.2 Instruction Fetch 
Instructions are fetched under control of MSRij,». 


The effective address of the instruction is inter- 
preted as described in Section 4.2.6, “Real 
Addressing Mode” on page 26. 


The effective address of the instruction is trans- 
lated by the Address Translation mechanism. (lf it 
cannot be translated, a storage exception occurs.) 


4.2.2.1 Implicit Branch 


Explicitly altering certain MSR bits (using mtmsr{ d]), 
or explicitly altering SLB entries, Page Table entries, 
or certain System Registers, may have the side effect 
of changing the addresses, effective or real, from 
which the current instruction stream is being fetched. 
This side effect is called an implicit branch. For 
example, an mtmsrd instruction that changes the 
value of MSRs may change the effective addresses 
from which the current instruction stream is being 
fetched. The MSR bits and System Registers for 
which alteration can cause an implicit branch are indi- 
cated as such in Chapter 7, “Synchronization 
Requirements for Special Registers and for Lookaside 
Buffers” on page 71. Implicit branches are not sup- 
ported by the PowerPC AS Architecture. If an implicit 
branch occurs, the results are boundedly undefined. 


4.2.3 Data Access 


Data accesses are controlled by MSRpp. 


The effective address of the data is interpreted as 
described in Section 4.2.6, “Real Addressing 
Mode” on page 26. 


The effective address of the data is translated by 


the Address Translation mechanism. (If it cannot 
be translated, a storage exception occurs.) 


4.2.4 Performing Operations 
Out-of-Order 


An operation is said to be performed “in-order” if, at 
the time that it is performed, it is known to be 
required by the sequential execution model. An oper- 
ation is said to be performed “out-of-order” if, at the 
time that it is performed, it is not known to be 
required by the sequential execution model. 


Architecture Note 


In earlier versions of the architecture specifica- 
tion, “speculative” was used instead of “out-of- 
order”. The terminology was changed to be 
consistent with the technical literature, where 
“speculative execution” often means the exe- 
cution of instructions past unresolved branches 
and “out-of-order execution” means execution of 
an instruction before it is known to be required by 
the sequential execution model. Because the 
meaning of “speculative” in the literature differs 
from ordinary English usage the term would cause 
confusion no matter how the architecture specifi- 
cation defined it, so the term is no longer used 
here at all. 


Operations are performed out-of-order by the hard- 
ware on the expectation that the results will be 
needed by an instruction that will be required by the 
sequential execution model. Whether the results are 
really needed is contingent on everything that might 
divert the control flow away from the instruction, such 
as Branch, Trap, System Call, and rfid instructions, 
and interrupts, and on everything that might change 
the context in which the instruction is executed. 


Typically, the hardware performs operations out-of- 
order when it has resources that would otherwise be 
idle, so the operation incurs little or no cost. If subse- 
quent events such as branches or interrupts indicate 
that the operation would not have been performed in 
the sequential execution model, the processor aban- 
dons any results of the operation (except as described 
below). 


In the remainder of this section, including its sub- 
sections, “Load instruction” includes the Cache Man- 
agement and other instructions that are stated in the 
instruction descriptions to be “treated as a Load”, and 
similarly for “Store instruction”. 


Most operations can be performed out-of-order, as 
long as the machine appears to follow the sequential 
execution model. Certain out-of-order operations are 
restricted, as follows. 
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m Stores 


Stores are performed in-order (even if the Store 
instructions that caused them were executed out- 
of-order). 


m Accessing Guarded Storage 


The restrictions for this case are given in Section 
4.2.4.2. 


No error of any kind other than Machine Check may 
be reported due to an operation that is performed 
out-of-order, until such time as it is known that the 
operation is required by the sequential execution 
model. The only other permitted side effects (other 
than Machine Check) of performing an operation out- 
of-order are the following. 


m Reference and Change bits may be set as 
described in Section 4.8, “Reference and Change 
Recording” on page 37. 


m Non-Guarded storage locations that could be 
fetched into a cache by in-order execution may 
be fetched out-of-order into that cache. 


Engineering Note 


Out-of-order execution of the stwcx. and stdcx. 
instructions is extremely complex and is not 
recommended. 


Engineering Note 


Because an External or Decrementer exception 
can become pending at any time, it might seem 
that if MSRgg=1 then fetching or executing any 
instruction beyond the current instruction is an 
out-of-order operation. However, these oper- 
ations need not be treated as out-of-order if the 
taking of the interrupt is delayed until after they 
have completed. Similar considerations apply to 
Floating-Point Enabled Exception type Program 
interrupts when one of the Imprecise floating- 
point exception modes is in effect. 


Engineering Note 


Implementations that perform operations out-of- 
order must take care to obey the sequential exe- 
cution model except as permitted by the 
architecture. Examples of cases that may require 
special attention include the following. 


m Changes of control flow, including sc, Trap, 
rfid, and interrupts as well as branches. 


m Changes of context due to changes of control 
flow. For example, the code at a branch 
target location, or the handler for System Call 
or Trap interrupts, may change the context 
and then return, so that the instructions 
immediately following the Branch, se, or Trap 
execute in a new context. 


m Changes to resources that affect address 
translation, storage protection, or storage 
control attributes, when the change is fol- 
lowed by the appropriate software synchroni- 
zation. Such resources include 
MSRsr prin DR SDR1, EAR, Page Tables, 
SLBs, and TLBs. 


m Execution synchronizing and context synchro- 
nizing operations. 


4.2.4.1 Guarded Storage 


Storage is said to be “well-behaved” if the corre- 
sponding real storage exists and is not defective, and 
if the effects of a single access to it are indistinguish- 
able from the effects of multiple identical accesses to 
it. Data and instructions can be fetched out-of-order 
from well-behaved storage without causing undesired 
side effects. 


Storage is said to be Guarded if either of the following 
conditions is satisfied. 


m MSR bit IR or DR is 1 for instruction fetches or 
data accesses respectively, and the G bit is 1 in 
the relevant Page Table Entry. 


m MSR bit IR or DR is 0 for instruction fetches or 
data accesses respectively, and the optional Real 
Mode Storage Control facility (see Section 8.3) is 
not implemented. In this case all of storage is 
Guarded for the corresponding accesses. 


In general, storage that is not well-behaved should be 
Guarded. Because such storage may represent a 
control register on an I/O device or may include 
locations that do not exist, an out-of-order access to 
such storage may cause an I/O device to perform 
unintended operations or may result in a Machine 
Check. 


The following rules apply to in-order execution of 
Load and Store instructions for which the first byte of 
the storage operand is in storage that is both Caching 
Inhibited and Guarded. 
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m Load or Store instruction that causes an atomic 
access 


If any portion of the storage operand has been 
accessed and an External, Decrementer, or 
Imprecise mode Floating-Point Enabled exception 
is pending, the instruction completes before the 
interrupt occurs. 


m Load or Store instruction that causes an Align- 
ment exception, or that causes a Data Storage 
exception for reasons other than Data Address 
Compare match or Data Address Breakpoint 
match 


The portion of the storage operand that is in 
Caching Inhibited and Guarded storage is not 
accessed. 


(The corresponding rules for instructions that 
cause a Data Address Compare match or Data 
Address Breakpoint match are given in Sections 
4.6 and 8.2 respectively.) 


Architecture Note 


The rules for accessing Guarded storage when an 
Imprecise mode Floating-Point Enabled exception 
is pending should be revisited when the architec- 
ture is clarified with respect to those modes. For 
example, it may be acceptable to require software 
synchronization between any instruction that 
could cause a floating-point enabled exception in 
Imprecise mode and a subsequent instruction that 
accesses Guarded storage. (A Floating-Point 
Status and Control Register instruction might 
provide sufficient synchronization.) 


4.2.4.2 Out-of-Order Accesses to 
Guarded Storage 


In general, Guarded storage is not accessed out-of- 
order. The only exceptions to this rule are the fol- 
lowing. 


Load Instruction 


If a copy of any byte of the storage operand is in a 
cache then that byte may be accessed in the cache or 
in main storage. 


Instruction Fetch 


If MSR,p=0 then an instruction may be fetched if any 
of the following conditions are met. 


1. The instruction is in a cache. In this case it may 
be fetched from the cache or from main storage. 


2. The instruction is in a real page from which an 
instruction has previously been fetched, except 
that if that previous fetch was based on condition 
1 then the previously fetched instruction must 
have been in the instruction cache. 


3. The instruction is in the same real page as an 
instruction that is required by the sequential exe- 
cution model, or is in the real page immediately 
following such a page. 


Programming Note 


Software should ensure that only well-behaved 
storage is copied into a cache, either by 
accessing as Caching Inhibited (and Guarded) all 
storage that may not be well-behaved, or by 
accessing such storage as not Caching Inhibited 
(but Guarded) and referring only to cache blocks 
that are well-behaved. 


If a real page contains instructions that will be 
executed when MSR,p=0, software should ensure 
that this real page and the next real page contain 
only well-behaved storage (or, if the optional Real 
Mode Storage Control Facility is implemented, 
that this real page is not Guarded). 


Engineering Note 


When MSR;p=0 or MSRpp=0, performance may 
be significantly degraded because all of storage 
defaults to being Guarded for the corresponding 
accesses. If it is important to avoid this degrada- 
tion, a means of specifying portions of real 
storage that are treated as non-Guarded in real 
addressing mode should be provided as described 
in Section 8.3, “Real Mode Storage Control” on 
page 77. 


4.2.5 32-Bit Mode 


The computation of the 64-bit effective address is 
independent of whether the processor is in 32-bit 
mode or 64-bit mode. In 32-bit mode (MSRgp=0), the 
high-order 32 bits of the 64-bit effective address are 
treated as zeros for the purpose of addressing 
storage. This applies to both data accesses and 
instruction fetches. It applies independent of whether 
address translation is enabled or disabled. This trun- 
cation of the effective address is the only respect in 
which storage accesses in 32-bit mode differ from 
those in 64-bit mode. 
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Programming Note 


Treating the high-order 32 bits of the effective 
address as zeros effectively truncates the 64-bit 
effective address to a 32-bit effective address 
such as would have been generated on a 32-bit 
implementation of the PowerPC Architecture. 
Thus, for example, the ESID in 32-bit mode is the 
high-order four bits of this truncated effective 
address; the ESID thus lies in the range 0-15. 
When address translation is enabled, these four 
bits would select a Segment Register on a 32-bit 
implementation of the PowerPC Architecture. The 
SLB entries that translate these 16 ESIDs can be 
used to emulate these Segment Registers. 


4.2.6 Real Addressing Mode 


Instruction fetches are performed in “real addressing 
mode” if instruction address translation is disabled 
(MSR,p=0). Data accesses are performed in real 
addressing mode if data address translation is disa- 
bled (MSRpp=0). Storage accesses in real 
addressing mode are performed in a manner that 
depends on the contents of MSRyy, LPES, and the 
RMLR and RMOR (see Section 1.7, “Logical Parti- 
tioning (LPAR)” on page 4), as described below. In 
all cases, bits 0:1 of the effective address are ignored 
and, on implementations that support a real address 
size of only m bits, m< 62, bits 2:63- m of the effective 
address may be ignored. 


m If MSRyy=1, bits 2:63 of the effective address 
are used as the real address for the access. 


m If MSRyy=0 and LPES=0, the access causes a 
storage exception as described in Section 4.9.2, 
“Storage Protection, Address Translation 
Disabled” on page 40. 


m If MSRyy=0 and LPES=1, the Offset Real Mode 
Address mechanism, described in Section 4.2.6.1, 
controls the access. 


4.2.6.1 Offset Real Mode Address 


If MSRyy=0 and LPES=1, the access is controlled by 
the contents of the Real Mode Limit Register and 
Real Mode Offset Register, as follows. 


Real Mode Limit Register (RMLR) 


If bits 2:63 of effective address for the access are 
greater than or equal to the value (limit) represented 
by the contents of the RMLR, the access causes a 
storage exception (see Section 4.9.2). The RMLR sup- 
ports effective address limits that are powers of 2. 
The number and values of the limits supported are 
implementation-dependent. 


Real Mode Offset Register (RMOR) 


If the access is permitted by the RMLR, the effective 
address for the access is ORed with the offset repres- 
ented by the contents of the RMOR and the low-order 
m bits of the result are used as the real address for 
the access. The number and values of the offsets 
supported are implementation-dependent. 


Programming Note 


The offset specified by the RMOR should be a 
non-zero multiple of the limit specified by the 
RMLR. If these registers are set thus, ORing the 
effective address with the offset produces a result 
that is equivalent to adding the effective address 
and the offset. (The offset must not be zero, 
because real page 0 contains the fixed interrupt 
vectors and real pages 1 and 2 may be used for 
implementation-specific purposes; see Section 
4.2.7, “Address Ranges Having Defined Uses” on 
page 27.) 


Engineering Note 


Ignoring bits 2:63—m of the effective address sim- 
plifies the real mode limit check. Specifically, if 
the minimum limit value supported by the imple- 
mentation is 2k, only bits 64- m:63-K of the effec- 
tive address need be checked. 
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4.2.6.2 Storage Control Attributes for 
Real Addressing Mode and for Implicit 
Storage Accesses 


Storage accesses in real addressing mode are per- 
formed as though all of storage had the following 
storage control attributes, except as modified by the 
optional Real Mode Storage Control facility (see 
Section 8.3) if that facility is implemented. (The 
storage control attributes are defined in Book Il, 
PowerPC AS Virtual Environment Architecture.) 


m not Write Through Required 

m not Caching Inhibited, for instruction fetches 

m not Caching Inhibited, for data accesses if the 
Real Mode Caching Inhibited bit is set to 0; 
Caching Inhibited, for data accesses if the Real 
Mode Caching Inhibited bit is set to 1 

m Memory Coherence Required, for data accesses 

m Guarded 


Implicit accesses to the Page Table by the processor 
in performing address translation and in recording 
reference and change information are performed as 
though the storage occupied by the Page Table had 
the following storage control attributes. 


not Write Through Required 
not Caching Inhibited 
Memory Coherence Required 
m not Guarded 


These implicit accesses are ordered by the sync 
instruction in the same manner as are explicit storage 
accesses. 


Software must ensure that any data storage location 
that is accessed with the Real Mode Caching Inhibited 
bit set to 1 is not in the caches. 


Software must ensure that the Real Mode Caching 
Inhibited bit contains 0 whenever data address trans- 
lation is enabled and whenever the processor is not in 
hypervisor state. 


Programming Note 


Because storage accesses in real addressing 
mode do not use the SLB or the Page Table, 
accesses in this mode bypass all checking and 
recording of information contained therein (e.g., 
storage protection checks that use information 
contained therein are not performed, and refer- 
ence and change information is not recorded). 


The Real Mode Caching Inhibited bit can be used 
to permit a control register on an I/O device to be 
accessed without permitting the corresponding 
storage location to be copied into the caches. 
The bit should normally contain O. Software 


would set the bit to 1 just before accessing the 
control register, access the control register as 
needed, and then set the bit back to 0. 


4.2.7 Address Ranges Having 
Defined Uses 


The address ranges described below have uses that 
are defined by the architecture. 


m Fixed interrupt vectors 


Except for the first 256 bytes, which are reserved 
for software use, the real page beginning at real 
address 0x0000 0000 0000 0000 is either used 
for interrupt vectors or reserved for future inter- 
rupt vectors. 


m Implementation-specific use 


The two contiguous real pages beginning at real 
address 0x0000 0000 0000 1000 are reserved for 
implementation-specific purposes. 


= Offset Real Mode interrupt vectors 


The real page beginning at the real address spec- 
ified by the RMOR is used similarly to the page 
for the fixed interrupt vectors. 


m Page Table 


A contiguous sequence of real pages beginning at 
the real address specified by SDR1 contains the 
Page Table. 


4.2.8 Invalid Real Address 


A storage access (including an access that is per- 
formed out-of-order; see Section 4.2.4) may cause a 
Machine Check if the accessed storage location con- 
tains an uncorrectable error or does not exist. In the 
latter case the Checkstop state may be entered. See 
Section 5.5.2, “Machine Check Interrupt” on page 55. 


Programming Note 


Hypervisor software must ensure that a storage 
access by a program in one partition will not 
cause a Checkstop or other system-wide event 
that could affect the integrity of other partitions 
(see Section 1.7, “Logical Partitioning (LPAR)” on 
page 4). For example, such an event could occur 
if a real address placed in a Page Table Entry or 
made accessible to a partition using the Offset 
Real Mode Address mechanism (see Section 
4.2.6.1) does not exist. 
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4.3 Address Translation 
Overview 


The effective address (EA) is the address generated 
by the processor for an instruction fetch or for a data 
access. If address translation is enabled (MSR,p= 1 
or MSRpp= 1 as appropriate), this address is passed 
to the Address Translation mechanism, which 
attempts to convert the address to a real address 
which is then used to access storage. 


The first step in address translation is to convert the 
effective address to a virtual address (VA), as 
described in Section 4.4. The second step, conversion 
of the virtual address to a real address (RA), is 
described in Section 4.5. 


If the effective address cannot be translated, a 
storage exception (see Section 4.2.1) occurs. 


Figure 13 gives an overview of the address trans- 
lation process. 


Effective Address 


| 


Lookup in 
SLB 


| 


Virtual Address 


| 


Lookup in 
Page Table 


| 


Real Address 


Figure 13. Address translation overview 


4.4 Virtual Address Generation 


Conversion of a 64-bit effective address to a virtual 
address is done by searching the Segment Lookaside 
Buffer (SLB) as shown in Figure 14. 


64-bit Effective Address 


36 28-p p 
ESID Page Byte 
2 e É Ser gs a 
Segment Lookaside 
Buffer (SLB) 
SLBEO ESID V| VSID KsKpNLC 
SLBEn 
0 35 31 88 89 93 
52 28-p p 
VSID Page Byte 


4 Virtual Page Number (VPN) > 


80-bit Virtual Address 


Figure 14. Translation of 64-bit effective address to 80-bit 
virtual address 
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4.4.1 Segment Lookaside Buffer 
(SLB) 


The Segment Lookaside Buffer (SLB) specifies the 
mapping between Effective Segment IDs (ESIDs) and 
Virtual Segment IDs (VSIDs). The number of SLB 
entries is implementation-dependent, except that all 
implementations provide at least 32 entries. 


The contents of the SLB are managed by software, 
using the instructions described in Section 4.10.2.1, 
“SLB Management Instructions” on page 42. See 
Chapter 7, “Synchronization Requirements for Special 
Registers and for Lookaside Buffers” on page 71 for 
the rules that software must follow when updating the 
SLB. 


SLB Entry 


Each SLB entry (SLBE) maps one ESID to one VSID. 
Figure 15 shows the layout of an SLB entry. 


0 35 37 89 93 


Bit(s) Name Description 
0:35 ESID Effective Segment ID 
36 V Entry valid (V=1) or 
invalid (V=0) 
37:88 VSID Virtual Segment ID 
89 K; Supervisor (privileged) state 
storage key 
90 Kp Problem state storage key 
91 N No-execute segment if N=1 
92 L Virtual pages are large (L=1) 
or 4 KB (L=0) 
9 C Class 


Figure 15. SLB Entry 


On implementations that support a virtual address 
size of only n bits, n< 80, bits 0:79- n of the VSID field 
are treated as reserved bits, and software must set 
them to zeros. 


A No-execute segment (N=1) contains data that 
should not be executed. 


The L bit selects between two virtual page sizes, 4 KB 
(p=12) and “large”. The large page size is an imple- 
mentation-dependent value that is a power of 2 and is 
in the range 8 KB : 256 MB (13< p< 28). Some imple- 
mentations may provide a means by which software 
can select the large page size from a set of several 
implementation-dependent sizes during system initial- 
ization. 


If “large page” is used in reference to real storage, it 
means the sequence of contiguous real (4 KB) pages 
to which a large virtual page is mapped. 


The Class field is used in conjunction with the slbie 
instruction (see Section 4.10.2.1). 


Software must ensure that the SLB contains at most 
one entry that translates a given effective address 
(i.e., that a given ESID is contained in no more than 
one SLB entry). 


Programming Note 


Because the virtual page size is used both in 
searching the Page Table and in forming the real 
address using the matching Page Table Entry 
(PTE) (see Section 4.5, “Virtual to Real 
Translation” on page 31), and PTEs contain no 
indication of the virtual page size, the virtual page 
size must be the same for all address translations 
that use a given VSID value. This has the fol- 
lowing consequences, which apply collectively to 
all processors that use the same Page Table. 


m The value of the L bit must be the same in all 
SLB entries that contain a given VSID value. 


m Before changing the value of the L bit in an 
SLB entry, software must invalidate all SLB 
entries, TLB entries, and PTEs that contain 
the corresponding VSID value. 


Engineering Note 


lt is suggested that implementations provide a 
mechanism by which software can select one of 
three different large page sizes. For example, an 
implementation might provide large page sizes of 
64 KB, 1 MB, and 16 MB. Because this selection 
will be changed very infrequently (i.e., only during 
system initialization), the selection mechanism 
need not be directly accessible to software. 


Architecture Note 


If additional SLB entry fields are defined in the 
future, consideration should be given to retaining 
the potential to enlarge the Class field. Such 
enlargement would be in the low-order direction 
(i.e., the current Class bit would become the high- 
order bit of the enlarged Class field). Related 
considerations affect the slbie, slbmte, and 
slbmfev instructions. 


Consideration should also be given to retaining 
the property that the Class value returned by 
slbmfev can be inserted into the register con- 
taining the ESID for sibie using a single instruc- 
tion. 
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4.4.2 SLB Search 


When the hardware searches the SLB, all entries are 
tested for a match with the EA. For a match to exist, 
the following must be true: 


= SLBEy= 1 
m SLBEgsip = EAo:5 


If the SLB search succeeds, the virtual address (VA) 
is formed by concatenating the VSID from the 
matching SLB entry with bits 36:63 of the EA. 


The Virtual Page Number (VPN) is bits 0:79-p of the 
virtual address. 


If the SLB search fails, a segment fault occurs. This is 
an Instruction Segment exception or a Data Segment 
exception, depending on whether the effective 
address is for an instruction fetch or for a data 
access. 
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4.5 Virtual to Real Translation 


Conversion of an 80-bit virtual address to a real address is done by searching the Page Table as shown in 
Figure 16. 


| 80-p— PA 
80-bit Virtual Address | Virtual Page Number (VPN) | Byte | 
L i J 
0 13 51 52 19-p 80-p 79 
L 7 IL IL | J 
| i} 
HTABORG HTABSIZE 4 
ry 44 13 5 r 11+p— 28-p— 
SDRL TH] ati A ta ewes xx000..... 00 | JH | | | Os | | 
Li i i | L i | 
0 2 17 18 45 59 63 0 38 
L 7 jòi i ] | L J 
| > 
+ | 
m~~ + + 
| | == 
| DECODE | | | 
| | | Hash Function | 
a | | 
L ~ 
| | 
Mask 4 4 
f 284 i 39 
| 000..... 00111..... 1 | | | 
L ii | 
0 27 0 27 28 38 
L J L It J 
| | | 
kres oS 
| | 
Lo Lo 
| | 
=a | aw | 
| | 
| 
———_! 
| 
| | 
| | 
| | 
4 4 | 
>A | Page Table 
| | | 
| o | | — + 16 bytes 
| | | / | T T T T T T T 1 
Es | / PTEO PTE? | PTEGO 
| | / 
+ + + / 
r 16— 28— 114 m | 
| | | [0000000 | - - - = = - - = É 
L i i i LA 
\ 
62-bit Real Address of Page Table Entry Group AS 
\ PTEGn 
\ | 
< 128 bytes > 
Page Table Entry (PTE) 
16 bytes 
| Ca II PE RR É CU CI ză PR ot TE 
| AVPN | sw |/ |a |v || Z| /| Real Page Number (RPN) | // |ac|R |c | mimo | | PP | 
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0 56 60 62 63 0 1 2 63-p 52 54 55 56 57 60 61 63 
L ] 
T 
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r 62-p-7 PA 
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Figure 16. Translation of 80-bit virtual address to 62-bit real address 
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4.5.1 Page Table 


The Hashed Page Table (HTAB) is a variable-sized 
data structure that specifies the mapping between 
Virtual Page Numbers and Real Page Numbers. The 
HTAB's size must be a multiple of 4 KB, its starting 
address must be a multiple of its size, and it must be 
located in storage having the storage control attri- 
butes that are used for implicit accesses to it (see 
Section 4.2.6.2). 


The HTAB contains Page Table Entry Groups (PTEGs). 
A PTEG contains 8 Page Table Entries (PTEs) of 16 
bytes each; each PTEG is thus 128 bytes long. PTEGs 
are entry points for searches of the Page Table. 


See Section 4.11, “Page Table Update Synchroniza- 
tion Requirements” on page 49 for the rules that soft- 
ware must follow when updating the Page Table. 


Programming Note 


The Page Table must be treated as a hypervisor 
resource (see Section 1.7, “Logical Partitioning 
(LPAR)” on page 4), and therefore must be placed 
in real storage to which only the hypervisor has 
write access. Moreover, the contents of the Page 
Table must be such that non-hypervisor software 
cannot modify storage that contains hypervisor 
programs or data. Finally, to protect against 
incorrect use of the L bit of SLB entries by non- 
hypervisor software, real storage that is mapped 
by the Page Table must be allocated to partitions 
in units each of which has a size that is a multiple 
of 2P bytes and is aligned at a 2P byte boundary, 
where 2P is the maximum large page size for any 
processor in the system. (Incorrect use of the L 
bit could cause the virtual address for a large 
virtual page to be translated using a PTE that was 
created to translate a 4 KB virtual page. If 2P 
were the maximum large page size for the parti- 
tion, instead of for the system, it might be neces- 
sary to change a processor's large page size as 
part of reassigning the processor to a different 
partition.) 


Page Table Entry 


Each Page Table Entry (PTE) maps one VPN to one 
RPN. Figure 17 shows the layout of a PTE. 


62 63 


IN ET 


PI en CG 


0 1 2 51 54 55 5657 606162 63 
Dword Bit(s) Name Description 
0 0:56 AVPN Abbreviated Virtual Page 
Number 
57:60 SW Available for software use 
62 H Hash function identifier 
63 V Entry valid (V=1) 
or invalid (V=0) 
2:51 RPN Real Page Number 
54 AC Address Compare bit 
55 R Reference bit 
56 C Change bit 
57:60 WIMG Storage control bits 
61 N No-execute page if N=1 
62:63 PP Page protection bits 


All other fields are reserved. 


Figure 17. Page Table Entry 


If p< 23, the Abbreviated Virtual Page Number (AVPN) 
field contains bits 0:56 of the VPN. Otherwise bits 
0:79%-p of the AVPN field contain bits 0:79-p of the 
VPN, and bits 80-p:56 of the AVPN field must be 
zeros. 


Programming Note 


If p< 23, the AVPN field omits the low-order 23- p 
bits of the VPN. These bits are not needed in the 
PTE, because the low-order 11 bits of the VPN are 
always used in selecting the PTEGs to be 
searched (see Section 4.5.3). 


On implementations that support a virtual address 
size of only n bits, n< 80, bits 0:79—n of the AVPN field 
must be zeros. 


The RPN field contains the page number of the real 
page that contains the first byte of the block of real 
storage to which the virtual page is mapped. If p> 12, 
the low-order p- 12 bits of the RPN field (bits 64- p:51 
of doubleword 1 of the PTE) must be 0. On implemen- 
tations that support a real address size of only m bits, 
m< 62, bits 0:61— m of the RPN field must be zeros. 


Programming Note 


For a large virtual page, the high-order 62—p bits 
of the RPN field (bits 0:61—p) comprise the large 
real page number. 
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Engineering Note 


The requirement that if p> 12 the low-order p- 12 
bits of the RPN field must be O permits bits 34:49 
of the 62-bit real address to be formed by ORing 
RPNaq:qg with 28-P0 | VPNeo_p:7 (equivalently, by 
ORing RPNaq:4g with 28-P0 | EAgy p:51): instead of 


by concatenating as described in Section 4.5.8. 
(To protect against incorrect use of the L bit of 
SLB entries by non-hypervisor software, bits 34:49 
of the 62-bit real address must not be formed by 
adding the two components.) 


A No-execute page (N=1) contains data that should 
not be executed. 


Page Table Size 


The number of entries in the Page Table directly 
affects performance because it influences the hit ratio 
in the Page Table and thus the rate of page faults. If 
the table is too small, it is possible that not all the 
virtual pages that actually have real pages assigned 
can be mapped via the Page Table. This can happen 
if too many hash collisions occur and there are more 
than 16 entries for the same primary/secondary pair 
of PTEGs. While this situation cannot be guaranteed 
not to occur for any size Page Table, making the Page 
Table larger than the minimum size (see Section 
4.5.2) will reduce the frequency of occurrence of such 
collisions. 


Programming Note 


If large pages are not used, it is recommended 
that the number of PTEGs in the Page Table be at 
least half the number of real pages to be 
accessed. For example, if the amount of real 
storage to be accessed is 23! bytes (2 GB), then 
we have 29'-12-219 real pages. The minimum 


recommended Page Table size would be 218 
PTEGs, or 225 bytes (32 MB). 


4.5.2 Storage Description Register 1 


The SDR1 register is shown in Figure 18. 


HTABORG //| |HTABSIZE 


0 2 45 59 63 


Bits Name 
2:45  HTABORG 
59:63 HTABSIZE 


All other fields are reserved. 
Figure 18. SDR1 


Description 
Real address of Page Table 
Encoded size of Page Table 


SDR1 is a hypervisor resource; see Section 1.7, 
“Logical Partitioning (LPAR)” on page 4. 


The HTABORG field in SDR1 contains the high-order 
44 bits of the 62-bit real address of the Page Table. 
The Page Table is thus constrained to lie on a 218 byte 
(256 KB) boundary at a minimum. At least 11 bits 
from the hash function (see Figure 16 on page 31) 
are used to index into the Page Table. The minimum 
size Page Table is 256 KB (2!! PTEGs of 128 bytes 
each). 


The Page Table can be any size 2" bytes where 
18<n<46. As the table size is increased, more bits 
are used from the hash to index into the table and the 
value in HTABORG must have more of its low-order 
bits equal to 0. 


The HTABSIZE field in SDR1 contains an integer 
giving the number of bits (in addition to the minimum 
of 11 bits) from the hash that are used in the Page 
Table index. This number must not exceed 28. 
HTABSIZE is used to generate a mask of the form 
0b00...011...1, which is a string of 28 — HTABSIZE 
O-bits followed by a string of HTABSIZE 1-bits. The 
1-bits determine which additional bits (beyond the 
minimum of 11) from the hash are used in the index 
(see Figure 16 on page 31). The number of low-order 
O bits in HTABORG must be greater than or equal to 
the value in HTABSIZE. 


On implementations that support a real address size 
of only m bits, m<62, bits 0:61-m of the HTABORG 
field are treated as reserved bits, and software must 
set them to zeros. 


Chapter 4. Storage Control 33 


IBM Confidential - Feb. 24, 1999 


Programming Note 


Let n equal the virtual address size (in bits) sup- 
ported by the implementation. If n<67, software 
should set the HTABSIZE field to a value that 
does not exceed n-39. Because the high-order 
80-n bits of the VSID are assumed to be zeros, 
the hash value used in the Page Table search will 
have the high-order 67-n bits either all Os 
(primary hash; see Section 4.5.3) or all 1s (sec- 
ondary hash). If HTABSIZE> n- 39, some of these 
hash value bits will be used to index into the Page 
Table, with the result that certain PTEGs will not 
be searched. 


Engineering Note 


Because software must ensure that the number of 
low-order O bits in HTABORG is greater than or 
equal to the value in HTABSIZE, the 62-bit real 
address of the PTEG can be formed by ORing the 
various components. 


Example: 


Suppose that the Page Table is 16,384 (21%) 128-byte 
PTEGs, for a total size of 22! bytes (2 MB). A 14-bit 
index is required. Eleven bits are provided from the 
hash to start with, so 3 additional bits from the hash 
must be selected. Thus the value in HTABSIZE must 
be 3 and the value in HTABORG must have its low- 
order 3 bits (bits 43:45 of SDR1) equal to O. This 
means that the Page Table must begin on a 23+11+7 
= 221 = 2 MB boundary. 


4.5.3 Page Table Search 


When the hardware searches the Page Table, the 
accesses are performed as described in Section 4.2.6, 
“Real Addressing Mode” on page 26. 


An outline of the HTAB search process is shown in 
Figure 16 on page 31. The detailed algorithm is as 
follows. 


1. Primary Hash: 
A 39-bit hash value is computed by Exclusive 
ORing bits 13:51 of the VPN with a 39-bit value 
formed by concatenating 11+p O-bits with the 
low-order 28-p bits of the VPN. The 62-bit real 
address of a PTEG is formed by concatenating the 
following values: 


= Bits 2:17 of SDR1 (the high-order 16 bits of 
HTABORG). 

m Bits 0:27 of the 39-bit hash value ANDed with 
the mask generated from bits 59:63 of SDR1 
(HTABSIZE) and then ORed with bits 18:45 of 
SDR1 (the low-order 28 bits of HTABORG). 

m Bits 28:38 of the 39-bit hash value. 

m Seven 0-bits. 


This operation identifies a particular PTEG, called 
the “primary PTEG”, whose eight PTEs will be 
tested. 


2. Secondary Hash: 

A 39-bit hash value is computed by taking the 
one's complement of the Exclusive OR of bits 
13:51 of the VPN with a 39-bit value formed by 
concatenating 11+p O-bits with the low-order 
28- p bits of the VPN. The 62-bit real address of 
a PTEG is formed by concatenating the following 
values: 


m Bits 2:17 of SDR1 (the high-order 16 bits of 
HTABORG). 

m Bits 0:27 of the 39-bit hash value ANDed with 
the mask generated from bits 59:63 of SDR1 
(HTABSIZE) and then ORed with bits 18:45 of 
SDR1 (the low-order 28 bits of HTABORG). 

m Bits 28:38 of the 39-bit hash value. 

m Seven 0-bits. 


This operation identifies the “secondary PTEG”. 


3. As many as 16 PTEs in the two identified PTEGs 
are tested for a match with the VPN. Let q = 
minimum(5, 28-p). For a match to exist, the fol- 
lowing must be true: 


m PTE,=0 for the primary PTEG, 1 for the sec- 
ondary PTEG 

mw PTEy=1 

m PTEavpng.s4= VA 0:51 


a if p< 28, PTEAVPNs2:514q VA 52:51 +4 


If one or more matches are found, the search is 
successful; otherwise it fails. If more than one 
match is found, the matching entries must be 
identical in all defined fields with the exception of 
SW, H, AC, R, and C. If they are, one of the 
matching entries is used, for the translation, Data 
Address Compare, and the setting of the R and C 
bits. If they are not, the translation and Data 
Address Compare are undefined, as is the setting 
of the R and C bits in the matching entries, and 
the remainder of this section does not apply. 


If the Page Table search succeeds, the real address 
(RA) is formed by concatenating bits 0:61-p of the 
RPN from the matching PTE with bits 64- p:63 of the 
effective address (the byte offset). 


RA=RPNosi-p | EAga- p:63 


The N (No-execute) value used for the storage access 
is the result of ORing the N bit from the matching PTE 
with the N bit from the SLB entry that was used to 
translate the effective address. 


If the Page Table search fails, a page fault occurs. 
This is an Instruction Storage exception or a Data 
Storage exception, depending on whether the effec- 
tive address is for an instruction fetch or for a data 
access. 
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Programming Note 


To obtain the best performance, Page Table 
Entries should be allocated beginning with the 
first empty entry in the primary PTEG, or with the 
first empty entry in the secondary PTEG if the 
primary PTEG is full. 


Translation Lookaside Buffer 


Conceptually, the Page Table is searched by the 
address relocation hardware to translate every refer- 
ence. For performance reasons, the hardware usually 
keeps a Translation Lookaside Buffer (TLB) that holds 
PTEs that have recently been used. The TLB is 
searched prior to searching the Page Table. As a 
consequence, when software makes changes to the 
Page Table it must perform the appropriate TLB inval- 
idate operations to maintain the consistency of the 
TLB with the Page Table (see Section 4.11). 


Programming Notes 


1. Page Table entries may or may not be cached 
in a TLB. 

2. It is possible that the hardware implements 
more than one TLB, such as one for data and 
one for instructions. In this case the size and 
shape of the TLBs may differ, as may the 
values contained therein. 

3. Use the tlbie or tibia instruction to ensure 
that the TLB no longer contains a mapping for 
a particular virtual page. 


4.6 Data Address Compare 


The Data Address Compare mechanism provides a 
means of detecting load and store accesses to a 
virtual page. 


The Data Address Compare mechanism is controlled 


by the Address Compare Control Register (ACCR), 
and by a bit in each Page Table Entry (PTE,c). 


m DWDR 


0 62 63 
Bit Name Description 
62 DW Data Write Enable 


63 DR Data Read Enable 
All other fields are reserved. 


Figure 19. Address Compare Control Register 


A Data Address Compare match occurs for a Load or 
Store instruction if, for any byte accessed, 


m PTE,o=1 for the PTE that translates the virtual 
address, and 


m the instruction is a Store and ACCR5y= 1, or the 
instruction is a Load and ACCRpp=1. 


If the above conditions are satisfied, a match also 
occurs for dcbz, eciwx, and ecowx. For the purpose 
of determining whether a match occurs, eciwx is 
treated as a Load, and dcbz and ecowx are treated as 
Stores. 


If the above conditions are satisfied, it is undefined 
whether a match occurs in the following cases. 


m The instruction is Store Conditional but the store 
is not performed. 

m The instruction is a Load/Store String of zero 
length. 


The Cache Management instructions other than debz 
never cause a match. 


A Data Address Compare match causes a Data 
Storage exception (see Section 5.5.3, “Data Storage 
Interrupt” on page 56). If a match occurs, some or all 
of the bytes of the storage operand may have been 
accessed; however, if a Store, debz, or ecowx instruc- 
tion causes the match, the bytes of the storage 
operand that are in a virtual page with PTEAc= 1 are 
not altered. 


Programming Note 


The Data Address Compare mechanism does not 
apply to instruction fetches, or to data accesses in 
real addressing mode (MSRpp=0). 


If a Data Address Compare match occurs for a 
Load instruction for which any byte of the storage 
operand is in storage that is both Caching Inhib- 
ited and Guarded, or for an eciwx instruction, it 
may not be safe for software to restart the 
instruction. 


Engineering Note 


In the case of a Data Address Compare match, it 
is preferable not to access any bytes of the 
storage operand at or after the first matching 
byte. This makes the Data Address Compare 
mechanism more useful for debugging. 
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4.7 Storage Control Bits 


When address translation is enabled, each storage 
access is performed under the control of the Page 
Table Entry used to translate the effective address. 
Each Page Table Entry contains storage control bits 
that specify the presence or absence of the corre- 
sponding storage control attribute (see the section 
entitled “Storage Control Attributes” in Book Il, 
PowerPC AS Virtual Environment Architecture) for all 
accesses translated by the entry, as shown in 
Figure 20. The bits are called W, |, M, and G. 


Engineering Note 


Mechanisms other than processors (e.g., I/O 
devices) usually issue memory requests that are 
coherent. Such a mechanism may use the same 
coherence protocol that the processors use. In 
this case, the mechanism's use of the coherence 
protocol for storage that is shared with the 
processors may be independent of whether the 
processors access that storage as Memory Coher- 
ence Required. 


Bit Storage Control Attribute 


wi 0 — not Write Through Required 
1 — Write Through Required 


| 0 — not Caching Inhibited 
1- Caching Inhibited 


M2 0 — not Memory Coherence Required 
1 — Memory Coherence Required 


G O — not Guarded 
1 — Guarded 


1. Support for the 1 value of the W bit is optional. 
Implementations that do not support the 1 value 
treat the bit as reserved and assume its value 
to be 0. 

2. Support for the 0 value of the M bit is optional. 
Implementations that do not support the 0 value 
assume the value ofthe bitto be 1, and may 


either preserve the value of the bit or write it as 1. 


Figure 20. Storage control bits 


Instructions are not fetched from storage for which 
the G bit in the Page Table Entry is set to 1 (see 
Section 4.9, “Storage Protection” on page 40). 


Programming Note 


In a uniprocessor system in which only the 
processor has caches, correct coherent execution 
does not require the processor to access storage 
as Memory Coherence Required, and accessing 
storage as not Memory Coherence Required may 
give better performance. 


Engineering Note 


Because instruction storage need not be con- 
sistent with data storage, it is permissible for an 
implementation to ignore the M bit for instruction 
fetches. 


Treating instruction fetches as noncoherent may 
result in better performance in an implementation 
in which a coherent storage request has greater 
latency or overhead than a noncoherent storage 
request. However, care must be taken to avoid 
using a copy of a storage location that was 
fetched noncoherently (in response to an instruc- 
tion fetch) to satisfy a subsequent coherent data 
request caused by a Load, Store, or Cache Man- 
agement instruction. Also, care must be taken to 
ensure that the instruction sequence for instruc- 
tion modification that is shown in the section enti- 
tled “Instruction Cache Instruction” in Book II has 
the effects described there. 


In system designs, consideration must be given to 
whether instruction fetches are to be noncoherent 
and, if so, how this choice affects the implementa- 
tion of I/O subsystems and I/O caches. For 
example, if the processor ignores the M bit for 
instruction fetches, the system could ensure that 
instructions being copied into main storage have 
been flushed from any I/O cache before the 
program using them is restarted. 
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4.7.1 Storage Control Bit Restrictions 


All combinations of W, |, M, and G values are sup- 
ported except those for which both W and | are 1. 


Programming Note 


If an application program requests both the Write 
Through Required and the Caching Inhibited attri- 
butes for a given storage location, the operating 
system should set the | bit to 1 and the W bit to 0. 


The value of the | bit must be the same for all 
accesses to a given real page. 


The value of the W bit must be the same for all 
accesses to a given real page. 


4.7.2 Altering the Storage Control Bits 


When changing the value of the | bit for a given real 
page from 0 to 1, software must set the | bit to 1 and 
then flush all copies of locations in the page from the 
caches using dcbf and icbi before permitting any 
other accesses to the page. 


When changing the value of the W bit for a given real 
page from O to 1, software must ensure that no 
processor modifies any location in the page until after 
all copies of locations in the page that are considered 
to be modified in the data caches have been copied to 
main storage using dcbst or dcbf. 


When changing the value of the M bit for a given real 
page, software must ensure that all data caches are 
consistent with main storage. The actions required to 
do this to are system-dependent. 


Programming Note 


For example, when changing the M bit in some 
directory-based systems, software may be 


required to execute dcbf instructions on each 
processor to flush all storage locations accessed 
with the old M value before permitting the 
locations to be accessed with the new M value. 


Additional requirements for changing the storage 
control bits are given in Section 4.11.1, “Page Table 
Updates” on page 49 and in Chapter 7, “Synchroni- 
zation Requirements for Special Registers and for 
Lookaside Buffers” on page 71. 


4.8 Reference and Change 
Recording 


If address translation is enabled (MSRjp=1 or 
MSRpp=1), Reference (R) and Change (C) bits are 
maintained in the Page Table Entry that is used to 
translate the virtual address. If the storage operand 
of a Load or Store instruction crosses a virtual page 
boundary, the accesses to the components of the 
operand in each page are treated as separate and 
independent accesses to each of the pages for the 
purpose of setting the Reference and Change bits. 


Reference and Change bits are set by the processor 
as described below. Setting the bits need not be 
atomic with respect to performing the access that 
caused the bits to be updated. An attempt to access 
storage may cause one or more of the bits to be set 
(as described below) even if the access is not per- 
formed. The bits are updated in the Page Table Entry 
if the new value would otherwise be different from the 
old, as determined by examining either the Page 
Table Entry or any corresponding lookaside informa- 
tion maintained by the processor (e.g., in a TLB). 


Reference Bit 


The Reference bit is set to 1 if the corresponding 
access (load, store, or instruction fetch) is required 
by the sequential execution model and is per- 
formed. Otherwise the Reference bit may be set to 
1 if the corresponding access is attempted, either 
in-order or out-of-order, even if the attempt causes 
an exception. 


Change Bit 


The Change bit is set to 1 if a Store instruction is 
executed and the store is performed. Otherwise 
the Change bit may be set to 1 if a Store instruction 
is executed and the store is permitted by the 
storage protection mechanism and, if the Store 
instruction is executed out-of-order, the instruction 
would be required by the sequential execution 
model in the absence of the following kinds of inter- 
rupts: 


m system-caused interrupts (i.e., System Reset, 
Machine Check, External, and Decrementer 
interrupts) 


m Floating-Point Enabled Exception type Program 
interrupts when the processor is in an Impre- 
cise mode 
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r— Programming Note 


Even though the execution of a Store instruc- 
tion causes the Change bit to be set to 1, the 
store might not be performed or might be only 
partially performed in cases such as the fol- 
lowing. 


m A Store Conditional instruction (stwex. or 
stdcx.) is executed, but no store is per- 
formed. 


mA Store String Word Indexed instruction 
(stswx) is executed, but the length is zero. 


m The Store instruction causes a Data 
Storage exception (for which setting the 
Change bit is not prohibited). 


m The Store instruction causes an Alignment 
exception. 


m The Page Table Entry that translates the 
virtual address of the storage operand is 
altered such that the new contents of the 
Page Table Entry preclude performing the 
store (e.g., the PTE is made invalid, or the 
PP bits are changed). 


For example, when executing a Store 
instruction, the processor may search the 
Page Table for the purpose of setting the 
Change bit and then reexecute the instruc- 
tion. When reexecuting the instruction, the 
processor may search the Page Table a 
second time. If the Page Table Entry has 
meanwhile been altered, by a program exe- 
cuting on another processor, the second 
search may obtain the new contents, which 
may preclude the store. 


m A system-caused interrupt occurs before 
the store has been performed. 


Figure 21 on page 39 summarizes the rules for 
setting the Reference and Change bits. The table 
applies to each atomic storage reference. It should 
be read from the top down; the first line matching a 
given situation applies. For example, if stwex. fails 
due to both a storage protection violation and the lack 
of a reservation, the Change bit is not altered. 


In the figure, the “Load-type” instructions are the 
Load instructions described in Books | and Il, eciwx, 


and the Cache Management instructions that are 
treated as Loads. The “Store-type” instructions are 
the Store instructions described in Books | and Il, 
ecowx, and the Cache Management instructions that 
are treated as Stores. The “ordinary” Load and Store 
instructions are those described in Books | and Il. 
“set” means “set to 1”. 


Engineering Note 


Any implementation-specific interrupt used to 
emulate instructions in software can be handled in 
a manner similar to a system-caused interrupt. 
That is, if the hardware can determine that the 
instruction to be emulated will not cause a precise 
architected interrupt then the Change bit can be 
set out-of-order past the instruction to be emu- 
lated under the same conditions as this bit can be 
set past a potential system-caused interrupt. 


When the hardware updates the Reference and 
Change bits in the Page Table Entry, the accesses are 
performed as described in Section 4.2.6, “Real 
Addressing Mode” on page 26. The accesses may be 
performed using operations equivalent to a store to a 
byte, halfword, word, or doubleword, and are not nec- 
essarily performed as an atomic read/modify/write of 
the affected bytes. 


These Reference and Change bit updates are not nec- 
essarily immediately visible to software. Executing a 
sync instruction ensures that all Reference and 
Change bit updates associated with address trans- 
lations that were performed, by the processor exe- 
cuting the sync instruction, before the sync instruction 
is executed will be performed with respect to that 
processor before the sync instruction's memory 
barrier is created. There are additional requirements 
for synchronizing Reference and Change bit updates 
in multiprocessor systems; see Section 4.11.1, “Page 
Table Updates” on page 49. 


Programming Note 


Because the sync instruction is execution synchro- 
nizing, the set of Reference and Change bit 
updates that are performed with respect to the 
processor executing the sync instruction before 
the memory barrier is created includes all Refer- 
ence and Change bit updates associated with 
instructions preceding the sync instruction. 
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Status of Access 


Storage protection violation 
Out-of-order I-fetch or Load-type inst'n 
Out-of-order Store-type inst'n 
Would be required by the sequential 
execution model in the absence of 
system-caused or imprecise 
interrupts? 
All other cases 
In-order Load-type or Store-type inst'n, 
access not performed 
Loaa-type inst'n 
Store-type inst'n 
Other in-order access 
I-fetch 
Ordinary Load, eciwx 
Other ordinary Store, ecowx, dcbz 
icbi, dcbt, dcbtst, dcbst, dcbf 


“Acc” means that it is acceptable to set the bit. 

1 It is preferable not to set the bit. 

2 If Cis set, R is also set unless it is already set. 

3 For Floating-Point Enabled Exception type Program 
interrupts, “imprecise” refers to the exception mode 
controlled by MSR¢eo re+: 


Figure 21. Setting the Reference and Change bits 


If software refers to a Page Table Entry when Engineering Note 
MSRpp= 1, Reference and Change bits in the associ- 
ated Page Table Entries are set as for ordinary loads 
and stores. See Section 4.11.1 for the rules software 
must follow when updating Reference and Change 


Since most TLB reloads do not require altering 
the Reference or Change bit in the Page Table 
Entry (PTE), it is suggested that on a TLB miss the 
search for the PTE be done without fetching the 


pits; PTEs for exclusive access. This will reduce cache 

Engineering Note thrashing due to TLB reloads. It is assumed that 

a nonexclusive request for a PTE will be returned 

If the hardware updates a Reference or Change with exclusive access if no other processor has a 
bit in the Page Table Entry without using an copy. 


atomic read/modify/write operation, care must be 
taken to avoid overwriting an update to any of 
these bits by another processor. Thus the datum 
written to the Page Table Entry must not contain a 
O value for any of these bits. 


Subject to the preceding requirement, when the 
hardware updates the Reference or Change bit in 
the Page Table Entry it is permissible to store the 
corresponding byte, halfword, word, or 
doubleword, with the relevant subset of these two 
bits updated, from any lookaside information (e.g., 
TLB) maintained by the processor. 
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4.9 Storage Protection 


The storage protection mechanism provides a means 
for selectively granting instruction fetch access, 
granting read access, granting read/write access, and 
prohibiting access to areas of storage based on a 
number of control criteria. 


The operation of the protection mechanism depends 
on whether address translation is enabled (MSRjp= 1 
or MSRpp=1, as appropriate for the access) or disa- 
bled (MSRip=0 or MSRpa=0, as appropriate for the 
access). 


If an instruction fetch is not permitted by the pro- 
tection mechanism, an Instruction Storage exception 
is generated. If a data access is not permitted by the 
protection mechanism, a Data Storage exception is 
generated. (See Section 4.2.1, “Storage Exceptions” 
on page 22.) 


When address translation is enabled, a protection 
domain is a range of unmapped effective addresses, a 
virtual page, or a segment. When address translation 
is disabled and LPES=1 there are two protection 
domains: the set of effective addresses that are less 
than the value specified by the RMLR, and all other 
effective addresses. When address translation is dis- 
abled and LPES=0 the entire effective address space 
comprises a single protection domain. A protection 
boundary is a boundary between protection domains. 


4.9.1 Storage Protection, Address 
Translation Enabled 


When address translation is enabled, the protection 
mechanism is controlled by the following. 


m MSRpp, which distinguishes between supervisor 
(privileged) state and problem state 
m K, and Kp, the supervisor (privileged) state and 
problem state storage key bits in the SLB entry 
used to translate the effective address 
m PP, page protection bits in the Page Table Entry 
used to translate the effective address 
m For instruction fetches only: 
— the N (No-execute) value used for the access 
(see Section 4.5.3) 
— PTEg, the G (Guarded) bit in the Page Table 
Entry used to translate the effective address 


Using the above values, the following rules are 
applied. 


1. For an instruction fetch, the access is not per- 
mitted if the N value is 1 or if PTEg=1. 


2. For any access except an instruction fetch that is 
not permitted by rule 1, a “Key” value is com- 
puted using the following formula: 


Key 4 (Kp 4 MSRpp) (K, & MSRpp) 
p PR Ss PR 


Using the computed Key, Figure 22 is applied. 
An instruction fetch is permitted for any entry in 
the figure except “no access”. A load is per- 
mitted for any entry except “no access”. A store 
is permitted only for entries with “read/write”. 


y Eis Access Authority 


read/write 
read/write 
read/write 


read only 


no access 
read only 
read/write 
read only 


Figure 22. PP bit protection states, address trans- 
lation enabled 


4.9.2 Storage Protection, Address 
Translation Disabled 


When address translation is disabled, the protection 
mechanism is controlled by the following (see Section 
1.7, “Logical Partitioning (LPAR)” on page 4 and 
Section 4.2.6, “Real Addressing Mode” on page 26). 


m LPES, which distinguishes between the two modes 
of using the LPAR facility 

m MSRyy which distinguishes between hypervisor 
state and other privilege states 

= RMLR, which specifies the real mode limit value 


Using the above values, Figure 23 is applied. The 
access is permitted for any entry in the figure except 
“no access”. 


| LPES HV | Access Authority 

| 0 0 no access 
0 1 read/write 
1 0 read/write or no access? 
1 1 read/write 


1. If the effective address for the access is less 
than the value specified by the RMLR the 
access authority is read/write; otherwise the 
access is not permitted. 


Figure 23. Protection states, address translation disa- 
bled 


Programming Note 


The comparison described in note 1 in Figure 23 
ignores bits 0:1 of the effective address and may 
ignore bits 2:63- m; see Section 4.2.6. 
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4.10 Storage Control Instructions 


4.10.1 Cache Management 
Instructions 


This section describes aspects of cache management 
that are relevant only to operating systems. 


For a dcbz instruction that causes the target block to 
be newly established in the data cache without being 
fetched from main storage, the processor need not 
verify that the associated real address is valid. The 
existence of a data cache block that is associated with 
an invalid real address (see Section 4.2.8) can cause 
a delayed Machine Check interrupt or a delayed 
Checkstop. 


Each implementation provides an efficient means by 
which software can ensure that all blocks that are 
considered to be modified in the data cache have 
been copied to main storage before the processor 
enters any power conserving mode in which data 
cache contents are not maintained. The means are 
described in the Book IV, PowerPC AS Implementa- 
tion Features document for the implementation. 


4.10.2 Lookaside Buffer Management 


All implementations have a Segment Lookaside Buffer 
(SLB), and provide the SLB Management instructions 
described in Section 4.10.2.1. 


For performance reasons, most implementations have 
a Translation Lookaside Buffer (TLB), which is a cache 
of recently used Page Table Entries (PTEs). The TLB 
is not necessarily kept consistent with the Page Table 
in main storage. When software alters the contents of 
a PTE, it must also invalidate all corresponding TLB 
entries. 


Each implementation that has a TLB provides a 
means by which software can do the following. 


m Invalidate the TLB entry that translates a given 
effective address 


m Invalidate all TLB entries 


An implementation may provide one or more of the 
TLB Management instructions described in Section 
4.10.2.2 in order to satisfy requirements in the pre- 
ceding list. Alternatively, an algorithm may be given 
that performs one of the functions listed above (a 
loop invalidating individual TLB entries may be used 
to invalidate the entire TLB, for example), or different 
instructions may be provided. Such algorithms or 


instructions are described in Book IV, PowerPC AS 
Implementation Features. Because most implementa- 
tions have a TLB and also provide instructions similar 
or identical to the TLB Management instructions 
described in Section 4.10.2.2, other sections of the 
Books assume that the TLB exists and that the 
instructions described in Section 4.10.2.2 are pro- 
vided. 


An implementation that does not have a TLB treats 
the corresponding instructions (tlbie, tibia, and 
tlbsync) either as no-ops or as illegal instructions. 


Programming Note 


Because the presence, absence, and exact 
semantics of the TLB Management instructions 
are implementation-dependent, it is recommended 
that system software “encapsulate” uses of these 
instructions into subroutines to minimize the 
impact of moving from one implementation to 
another. 


Programming Note 


The function of all the instructions described in 
Sections 4.10.2.1 and 4.10.2.2 is independent of 
whether address translation is enabled or disa- 
bled. 


For a discussion of software synchronization 
requirements when invalidating SLB and TLB 
entries, see Chapter 7, “Synchronization Require- 
ments for Special Registers and for Lookaside 
Buffers” on page 71. 


Engineering Note 


It is possible for the hardware to implement more 
than one TLB, such as one for data and one for 
instructions. If this approach is taken, the 
requirement for an instruction that invalidates a 
TLB entry may be satisfied by a single instruction 
for all TLBs or by separate instructions for each 
TLB. 


Engineering Note 


Primary opcode 31, extended opcode 308, can be 
used for a privileged implementation-specific TLB 
invalidation function. 


Primary opcode 31, extended opcodes 978 and 
1010, can be used for a privileged implementa- 
tion-specific TLB reload function for data and 
instructions respectively. 
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4.10.2.1 SLB Management Instructions 


Programming Note 


Accesses to a given SLB entry caused by the 
instructions described in this section obey the 
sequential execution model with respect to the 
contents of the entry and with respect to data 
dependencies on those contents. That is, if an 
instruction sequence contains two or more of 
these instructions, when the sequence has com- 
pleted, the final state of the SLB entry and of 
General Purpose Registers is as if the instructions 
had been executed in program order. 


However, software synchronization is required in 
order to ensure that any alterations of the entry 
take effect correctly with respect to address trans- 
lation; see Chapter 7. 
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SLB Invalidate Entry X-form 


slbie RB 


31 WII II RB 434 / 
0 6 11 16 21 31 


esid + (RB) 9.35 
class + (RB) 3 
if class = SLBEç for SLB entry that translates 
or most recently translated esid 
then for SLB entry (if any) that translates esid 
SLBEy € 0 
all other fields of SLBE € undefined 
else translation of esid € undefined 


Let the Effective Segment ID (ESID) be (RB)o.35. Let 
the class be (RB)gg. The class value must be the 
same as the Class value in the SLB entry that trans- 
lates the ESID, or the Class value that was in the SLB 
entry that most recently translated the ESID if the 
translation is no longer in the SLB; if the class value 
is not the same, the results of translating effective 
addresses for which EAç.35=ESID are undefined, and 
the next paragraph need not apply. 


If the SLB contains an entry that translates the speci- 
fied ESID, the V bit in that entry is set to O, making 
the entry invalid, and the remaining fields of the entry 
are set to undefined values. 


(RB)37-63 must be zeroes. 

If this instruction is executed in 32-bit mode, (RB)9.31 
must be zeros (i.e., the ESID must be in the range 
0-15). 


This instruction is privileged. 


Special Registers Altered: 
None 


Programming Note 


The reason the class value specified by s/bie must 
be the same as the Class value that is or was in 
the relevant SLB entry is that the processor may 
use these values to optimize invalidation of imple- 
mentation-specific lookaside information used in 
address translation. If the value specified by sibie 
differs from the value that is or was in the rele- 
vant SLB entry, these optimizations may produce 
incorrect results. (An example of implementa- 
tion-specific address translation lookaside infor- 
mation is the set of recently used translations of 
effective addresses to real addresses that some 
processors maintain in an Effective to Real 
Address Translation (ERAT) lookaside buffer.) 


The recommended use of the Class field is to 
classify SLB entries according to the expected 
longevity of the translations they contain, or a 
similar property such as whether the translations 
are used by all programs or only by a single 
program. If this is done and the processor invali- 
dates certain implementation-specific lookaside 
information based only on the specified class 
value, an sibie instruction that invalidates a short- 
lived translation will preserve such lookaside 
information for long-lived translations. 


If the optional “Bridge” facility is implemented 
(see Section 9.1), the Move To Segment Register 
instructions create SLB entries in which the Class 
value is 0. 


Programming Note 


The only SLB entry that is invalidated is the entry 
(if any) that translates the specified ESID. 


sibie does not affect SLBs on other processors. 


Engineering Note 


(RB)37.63 must be ignored by the processor. 


Preserving the contents of the SLB entry (other 
than the V bit) when an sibie instruction is exe- 
cuted, and returning the contents of the SLB entry 
when an slbmfev or slbmfee instruction is exe- 
cuted that specifies an invalid SLB entry, facili- 
tates the debugging of software. 
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Engineering Note 


An example of how the class value can be used to 
optimize invalidation of implementation-specific 
address translation lookaside information is as 
follows. (The class value has no architecturally 
defined use, nor does the Class field of SLB 
entries.) 


On implementations that have an Effective to Real 
Address Translation lookaside buffer (ERAT), the 
class value can be used to select the ERAT 
entries to invalidate when an sibie instruction is 
executed. (Invalidating only ERAT entries in 
which the Class value is equal to the specified 
class value is likely to provide better performance 
than invalidating all ERAT entries.) If ERAT 
entries are used to translate effective addresses 
in real addressing mode, those entries can be 
treated as if they contain a Class value that lies 
outside the range supported by the SLB entry, so 
that slbie does not invalidate them. 


Architecture Note 


Bits 11:15 of the sibie instruction (ordinarily the 
position of an RA field) must be zero. This pro- 
vides implementations the option of using 
(RA|0)+(RB) address arithmetic for slbie. 


The requirement that RB37.g3 contain zeros and be 
ignored by the processor permits the Class field 
of the SLB entry and the class value supplied by 
slbie to be enlarged in the future if that proves 
desirable. 


The requirement that RBo.3; contain zeros in 
32-bit mode permits normal EA computation (in 
which the high-order 32 bits of the result are 
treated as zeros in 32-bit mode but not in 64-bit 
mode) to be used for sibie. 


SLB Invalidate All X-form 


slbia 


31 WI II WI 498 / 
0 6 11 16 21 31 


for each SLB entry except SLB entry 0 
SLBEy + 0 
all other fields of SLBE 4 undefined 


For all SLB entries except SLB entry 0, the V bit in 
the entry is set to 0, making the entry invalid, and the 
remaining fields of the entry are set to undefined 
values. SLB entry 0 is not altered. 

This instruction is privileged. 


Special Registers Altered: 
None 


Programming Note —————————————__ 
sibia does not affect SLBs on other processors. 


Programming Note 


If slbia is executed when instruction address 
translation is enabled (MSR;p=1), software can 
ensure that attempting to fetch the instruction fol- 
lowing the slbia does not cause an Instruction 
Segment interrupt by placing the sibia and the 
subsequent instruction in the effective segment 
mapped by SLB entry 0. (The preceding assumes 
that no other interrupts occur between executing 
the slbia and executing the subsequent instruc- 
tion.) 
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SLB Move To Entry X-form 


slbmte RS,RB 


31 RS VII RB 402 / 
0 6 11 16 21 31 


The SLB entry specified by bits 52:63 of register RB is 
loaded from register RS and from the remainder of 


register RB. The contents of these registers are 
interpreted as shown in Figure 24. 
RS 
| VSID [KK NLC| Os 
0 52 56 63 
RB 
ESID V Os index 
0 35 37 52 63 
RS9:54 VSID 
RSs K; 
RS53 Kp 
RS54 N 
RSss L 
RSsg C 
RSs7.63 must be 0b000 0000 
RBo.35 ESID 
RBas V 
RBs751 must be 0b000 | 0x000 
RBs263 index, which selects the SLB entry 


Figure 24. GPR contents for slbmte 


On implementations that support a virtual address 
size of only n bits, n< 80, (RS)o:793-n must be zeros. 


High-order bits of (RB)s52:3 that correspond to SLB 
entries beyond the size of the SLB provided by the 
implementation must be zeros. 


If this instruction is executed in 32-bit mode, (RB)o.3; 
must be zeros (i.e., the ESID must be in the range 
0-15). 


This instruction cannot be used to invalidate an SLB 
entry. 


This instruction is privileged. 


Special Registers Altered: 
None 


Programming Note 


The reason slbmte cannot be used to invalidate 
an SLB entry is that it does not necessarily affect 
implementation-specific address translation look- 
aside information. sibie (or slbia) must be used 
for this purpose. 


Engineering Note 
(RS)s7:63 must be ignored by the processor. 


Architecture Note 


The requirement that R$57.63 contain zeros and be 
ignored by the processor permits the Class field 
of the SLB entry to be enlarged in the future if 
that proves desirable. 


The requirement that RBo.3; contain zeros in 
32-bit mode permits normal EA computation (in 
which the high-order 32 bits of the result are 
treated as zeros in 32-bit mode but not in 64-bit 
mode) to be used for s/bmte. 
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SLB Move From Entry VSID X-form 


slbmfev RT,RB 


31 RT WI RB 851 / 
0 6 11 16 21 31 


If the SLB entry specified by bits 52:63 of register RB 
is valid (V=1), the contents of the VSID, Kg, Kos N, L, 
and C fields of the entry are placed into register RT. 
The contents of these registers are interpreted as 
shown in Figure 25. 


RT 


| VSID [KsKpNLC| Os 


RB 


o 
q 
N 
a 
a 
Q 
ow 


S 
p 


v 
K 
RTsg K 
N 
L 
o; 
set to 0b000_0000 


must be 0x0_0000_0000_0000 
index, which selects the SLB entry 


Figure 25. GPR contents for slbmfev 


On implementations that support a virtual address 
size of only n bits, n< 80, RTp.79_, are set to zeros. 


If the SLB entry specified by bits 52:63 of register RB 
is invalid (V=0), the contents of register RT are unde- 
fined. 


High-order bits of (RB)so.63 that correspond to SLB 
entries beyond the size of the SLB provided by the 
implementation must be zeros. 


This instruction is privileged. 


Special Registers Altered: 
None 


Architecture Note 


The requirement that RTs7ez be set to zeros 
permits the Class field of the SLB entry to be 
enlarged in the future if that proves desirable. 
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SLB Move From Entry ESID X-form 


slbmfee RT,RB 


31 RT WI RB 915 / 
0 6 11 16 21 31 


If the SLB entry specified by bits 52:63 of register RB 
is valid (V=1), the contents of the ESID and V fields 
of the entry are placed into register RT. The contents 
of these registers are interpreted as shown in Figure 
26. 


RT 

ESID V Os 
0 35 37 63 
RB 

Os index 

0 52 63 
RTo:35 ESID 
RT 36 V 
RTs763 set to 0b000 | 0x00_0000 
RBo:51 must be 0x0 0000 0000 0000 


RBs263 index, which selects the SLB entry 
Figure 26. GPR contents for slbmfee 
If the SLB entry specified by bits 52:63 of register RB 


is invalid (V=0), RTsg is set to O and the contents of 
RTo:35 and RTs7.63 are undefined. 


High-order bits of (RB)se3 that correspond to SLB 
entries beyond the size of the SLB provided by the 
implementation must be zeros. 


This instruction is privileged. 


Special Registers Altered: 
None 
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4.10.2.2 TLB Management Instructions (Optional) 


TLB Invalidate Entry X-form 


tibie RB,L 
[POWER mnemonic: tlbi] 


31 Hf (L 4 RB 306 / 


if L=0 
then pg size ¢ 4 KB 
else pg size € large page size 
p + log base 2(pg size) 
for each TLB entry 
if (entry_VPN39.79 5 = (RB) 16:63-p) & 
(entry_pg_size = pg_size) 
then TLB entry € invalid 


The contents of (RB)9.15 must be 0x0000. If the L field 
of the instruction is 1 let the page size be large; oth- 
erwise let the page size be 4 KB. 


All TLB entries that have all of the following proper- 
ties are made invalid on all processors. 


m The entry translates a virtual address for which 
VPNao:79-p is equal to (RB)16:63-p- 

m The page size of the entry matches the page size 
specified by the L field of the instruction. 


Additional TLB entries may also be made invalid on 
any processor. 


MSRs must be 1 when this instruction is executed; 
otherwise the results are undefined. 


The operation performed by this instruction is ordered 
by the eieio (or sync) instruction with respect to a 
subsequent tlbsync instruction executed by the 
processor executing the tlbie instruction. The oper- 
ations caused by tlbie and tlbsync are ordered by 
eieio as a third set of operations, which is inde- 
pendent of the other two sets that eieio orders. 


This instruction is privileged, and can be executed 
only in hypervisor state. If it is executed in privileged 
but non-hypervisor state either a Privileged Instruc- 
tion type Program interrupt occurs or the results are 
boundedly undefined. 


This instruction is optional. 


See Section 4.11.1, “Page Table Updates” on page 49 
for a description of other requirements associated 
with the use of this instruction. 


Special Registers Altered: 
None 


Architecture Note 


Bits 11:15 of the tibie instruction (ordinarily the 
position of an RA field) must be zero. This pro- 
vides implementations the option of using 
(RA|0)+(RB) address arithmetic for tlbie. 


The requirement that tlbie be executed only in 
64-bit mode permits normal EA computation (in 
which the high-order 32 bits of the result are 
treated as zeros in 32-bit mode but not in 64-bit 
mode) to be used for tlbie. (If tlbie were executed 
in 32-bit mode, on an implementation that does 
normal EA computation for tlbie the high-order 16 
bits of the specified VPN bits would be treated as 
zeros.) 


The requirement that tibie (and tlbsync) be exe- 
cuted only in hypervisor state reduces implemen- 
tation complexity, by avoiding the need to ensure 
that violation of the requirements described in 
Section 4.11.1 by non-hypervisor software does 
not cause a Checkstop or other significant 
system-wide event. 


Architecture Note 


Cumulative ordering is moot for the memory 
barrier created by eieio for tlbie and tlbsync, 
because at most one processor should execute 
these instructions at a time (see Section 4.11.1). 


Engineering Note 


Causing a Privileged Instruction type Program 
interrupt if tlbie or tlbsync is executed in privi- 
leged but non-hypervisor state facilitates the 
debugging of software. 
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TLB Invalidate All X-form 


tibia 


31 HH HH HH 370 / 
0 6 11 16 21 31 


all TLB entries € invalid 


All TLB entries are made invalid on the processor 
executing the tlbia instruction. 


This instruction is privileged. 
This instruction is optional. 


Special Registers Altered: 
None 


Programming Note 
tibia does not affect TLBs on other processors. 


TLB Synchronize X-form 


tlbsync 


31 III II III 566 / 


S 
o 
= 
an 
D 
N 
y 
w 
= 


The tlbsync instruction provides an ordering function 
for the effects of all tlbie instructions executed by the 
processor executing the tlbsync instruction, with 
respect to the memory barrier created by a subse- 
quent sync instruction executed by the same 
processor. Executing a tlbsync instruction ensures 
that all of the following will occur. 


m All TLB invalidations caused by tlbie instructions 
preceding the tlbsync instruction will have com- 
pleted on any other processor before any data 
accesses caused by instructions following the 
sync instruction are performed with respect to 
that processor. 


m All storage accesses by other processors for 
which the address was translated using the trans- 
lations being invalidated, and all Reference and 
Change bit updates associated with address 
translations that were performed by other 
processors using the translations being invali- 
dated, will have been performed with respect to 
the processor executing the sync instruction, to 
the extent required by the associated Memory 
Coherence Required attributes, before the sync 
instruction's memory barrier is created. 


The operation performed by this instruction is ordered 
by the eieio (or sync) instruction with respect to pre- 
ceding tlbie instructions executed by the processor 
executing the tlbsync instruction. The operations 
caused by tlbie and tlbsync are ordered by eieio as a 
third set of operations, which is independent of the 
other two sets that eieio orders. 


The tlbsync instruction may complete before oper- 
ations caused by tlbie instructions preceding the 
tlbsync instruction have been performed. 


This instruction is privileged, and can be executed 
only in hypervisor state. If it is executed in privileged 
but non-hypervisor state either a Privileged Instruc- 
tion type Program interrupt occurs or the results are 
boundedly undefined. 


This instruction is optional. 

See Section 4.11.1, “Page Table Updates” on page 49 
for a description of other requirements associated 
with the use of this instruction. 


Special Registers Altered: 
None 


Architecture Note 


See the first Architecture Note in the tlbie instruc- 
tion description for an explanation of why tlbsync 
can be executed only in hypervisor state. 
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4.11 Page Table Update 
Synchronization Requirements 


This section describes rules that software should 
follow when updating the Page Table, and includes 
suggested sequences of operations for some repre- 
sentative cases. 


In the sequences of operations shown in the following 
subsections, any alteration of a Page Table Entry 
(PTE) that corresponds to a single line in the 
sequence is assumed to be done using a Store 
instruction for which the access is atomic. Appro- 
priate modifications must be made to these 
sequences if this assumption is not satisfied (e.g., if a 
store doubleword operation is done using two Store 
Word instructions). 


Sequences that use the tibie instruction may require 
a context synchronizing operation before and/or after 
the sequence; see Chapter 7, “Synchronization 
Requirements for Special Registers and for Lookaside 
Buffers” on page 71. Similarly, sequences that add a 
PTE require a context synchronizing operation after 
the sequence if the new entry is needed in order to 
translate the effective addresses of subsequent 
instructions. 


Page Table Entries must not be changed in a manner 
that causes an implicit branch. 


4.11.1 Page Table Updates 


TLBs are non-coherent caches of the HTAB. TLB 
entries must be invalidated explicitly with one of the 
TLB Invalidate instructions. 


Unsynchronized lookups in the HTAB continue even 
while it is being modified. Any processor, including 
the processor modifying the HTAB, may look in the 
HTAB at any time in an attempt to reload a TLB entry. 
When altering a PTE, software must ensure that the 
PTE's Valid bit is O if the PTE is inconsistent (e.g., if 
the RPN field is not correct for the current AVPN 
field). 


Updates of Reference and Change bits by the 
processor are not synchronized with the accesses that 
cause the updates. When modifying the low-order 
half of a PTE, software must take care to avoid over- 
writing a processor update of these bits and to avoid 
having the value written by a Store instruction over- 
written by a processor update. The processor does 
not alter any other fields of the PTE. 


In a multiprocessor system, when one or more tibie 
instructions have been executed by a processor in a 


given partition, the following sequence of instructions 
must be executed by that processor before a tlbie or 
tlbsync instruction is executed by another processor 
in that partition. 

eieio 

tlbsync 

sync 


Other instructions may be interleaved with this 
sequence of instructions, but these instructions must 
appear in the order shown. 


Programming Note 


The eieio instruction prevents the reordering of 
tlbie instructions previously executed by the 
processor with respect to the subsequent tlbsync 
instruction. The tlbsync instruction and the subse- 
quent sync instruction together ensure that all 
storage accesses for which the address was 
translated using the translations being invali- 
dated, and all Reference and Change bit updates 
associated with address translations that were 
performed using the translations being invali- 
dated, will be performed with respect to any 
processor or mechanism, to the extent required 
by the associated Memory Coherence Required 
attributes, before any data accesses caused by 
instructions following the sync instruction are per- 
formed with respect to that processor or mech- 
anism. 


Similarly, when a tlbsync instruction has been exe- 
cuted by a processor in a given partition, a sync 
instruction must be executed by that processor before 
a tlbie or tlbsync instruction is executed by another 
processor in that partition. 


The sequences of operations shown in the following 
subsections assume a multiprocessor environment. 
In a uniprocessor environment the tlbsync can be 
omitted, as can the eieio that separates the tlbie from 
the tlbsync. 


4.11.1.1 Adding a Page Table Entry 


This is the simplest Page Table case. The Valid bit of 
the old entry is assumed to be 0. The following 
sequence can be used to create a PTE, maintain a 
consistent state, and ensure that a subsequent refer- 
ence to the virtual address translated by the new 
entry will use the correct real address and associated 
attributes. 


PTERPN,AC,R,C,WIMG,N,PP t new values 


eieio /* order lst update before 2nd */ 
PTE AVPN,SW,H.V ¢ new values (V=1) 
sync /* order updates before next 

storage accesses xj; 
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4.11.1.2 Modifying a Page Table Entry 


General Case 


If a valid entry is to be modified and the translation 
instantiated by the entry being modified is to be inval- 
idated, the following sequence can be used to modify 
the PTE, maintain a consistent state, ensure that the 
translation instantiated by the old entry is no longer 
available, and ensure that a subsequent reference to 
the virtual address translated by the new entry will 
use the correct real address and associated attri- 
butes. (The sequence is equivalent to deleting the 
PTE and then adding a new one; see Sections 4.11.1.3 
and 4.11.1.1.) 


PTEy € 0 /* (other fields don't matter) x 
sync /* order update before tlbie and 

before next storage accesses */ 
tlbie (old_VPNg9.79 5,old_L) /* invalidate old 


translation x] 
eieio /* order tlbie before tlbsync x 
tlbsync /* order tlbie before sync */ 
sync /* order tlbie and tlbsync before 
next storage accesses xy 
PTERPN,AC,R,CWIMGN,PP € new values 
eieio /* order 2nd update before 3rd X 
PTE AVPN,SW,H,V ¢ new values (V=1) 
sync /* order 2nd and 3rd updates before 
next storage accesses x7 


Resetting the Reference Bit 


If the only change being made to a valid entry is to 
set the Reference bit to 0, a simpler sequence suf- 
fices because the Reference bit need not be main- 
tained exactly. 


oldR + PTER /* get old R */ 
if oldR = 1 then 
PTER « 0 /* store byte (R=0, other bits 
unchanged) */ 
tlbie (VPN39.79_p,L) /* invalidate entry */ 
eieio Pe order tlbie before tlbsync xj: 
tlbsync /* order tlbie before sync ej 
sync /* order tlbie, tlbsync, and 
update before next storage 
accesses ay 


Modifying the Virtual Address 


If the virtual address translated by a valid PTE is to 
be modified and the new virtual address hashes to 
the same two PTEGs as does the old virtual address, 
the following sequence can be used to modify the 
PTE, maintain a consistent state, ensure that the 
translation instantiated by the old entry is no longer 
available, and ensure that a subsequent reference to 
the virtual address translated by the new entry will 
use the correct real address and associated attri- 
butes. 


PTE AVPN,SW,H.V ¢ new values (V=1) 

sync /* order update before tlbie and 
before next storage accesses */ 

tlbie (old_VPNg9.79 5,0ld_L) /* invalidate old 


translation */ 
eieio /* order tlbie before tlbsync */ 
tlbsync /* order tlbie before sync */ 
sync /* order tlbie and tlbsync before 

next storage accesses s 


To modify the AC, N, or PP bits without overwriting a 
Reference or Change bit update being performed by 
the processor or by some other processor in the 
system, a sequence similar to that shown above can 
be used except that the first line would be replaced 
by a sync instruction followed by a loop containing a 
Idarx/stdex. pair that emulates an atomic “Compare 
and Swap” of the low-order doubleword of the PTE. 
(See the section entitled “Atomic Update Primitives” 
in Book II, PowerPC AS Virtual Environment Architec- 
ture for a description of “Compare and Swap”.) 


4.11.1.3 Deleting a Page Table Entry 


The following sequence can be used to ensure that 
the translation instantiated by an existing entry is no 
longer available. 


PTEy ¢ O /* (other fields don't matter) */ 
sync /* order update before tlbie and 

before next storage accesses */ 
tlbie (old_VPN39.79 pr OLA L) /* invalidate old 


translation a) 
eieio /* order tlbie before tlbsync */ 
tlbsync /* order tlbie before sync t 
sync /* order tlbie and tlbsync before 

next storage accesses */ 
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Chapter 5. Interrupts 


51 Overview ................ 51 
5.2 Interrupt Synchronization ...... 51 
5.3 Interrupt Classes ........... 52 
5.3.1 Precise Interrupt .......... 52 


5.3.2 Imprecise Interrupt ......... 52 


5.4 Interrupt Processing ......... 53 
5.5 Interrupt Definitions ......... 54 
5.5.1 System Reset Interrupt ...... 55 
5.5.2 Machine Check Interrupt ..... 55 
5.5.3 Data Storage Interrupt ...... 56 
5.5.4 Data Segment Interrupt ...... 57 
5.5.5 Instruction Storage Interrupt ... 58 
5.5.6 Instruction Segment Interrupt .. 58 
5.5.7 External Interrupt .......... 59 


5.1 Overview 


The PowerPC AS Architecture provides an interrupt 
mechanism to allow the processor to change state as 
a result of external signals, errors, or unusual condi- 
tions arising in the execution of instructions. 


System Reset and Machine Check interrupts are not 
ordered. All other interrupts are ordered such that 
only one interrupt is reported, and when it is proc- 
essed (taken) no program state is lost. Since 
Save/Restore Registers SRRO and SRR1 are serially 
reusable resources used by most interrupts, program 
state may be lost when an unordered interrupt is 
taken. 


5.5.8 Alignment Interrupt ........ 59 
5.5.9 Program Interrupt ......... 60 
5.5.10 Floating-Point Unavailable 

Interrupt 2.2... ee 62 
5.5.11 Decrementer Interrupt ...... 62 
5.5.12 System Call Interrupt ....... 62 
5.5.13 Trace Interrupt .......... 63 
5.5.14 Performance Monitor Interrupt 
(Optional) ................._ 63 
5.6 Partially Executed Instructions ... 63 
5.7 Exception Ordering ......... 64 
5.7.1 Unordered Exceptions ....... 64 
5.7.2 Ordered Exceptions ........ 64 
5.8 Interrupt Priorities .......... 65 


5.2 Interrupt Synchronization 


When an interrupt occurs, SRRO is set to point to an 
instruction such that all preceding instructions have 
completed execution, no subsequent instruction has 
begun execution, and the instruction addressed by 
SRRO may or may not have completed execution, 
depending on the interrupt type. 


With the exception of System Reset and Machine 
Check interrupts, all interrupts are context synchro- 
nizing as defined in Section 1.6.1, “Context 
Synchronization” on page 3. System Reset and 
Machine Check interrupts are context synchronizing if 
they are recoverable (i.e., if bit 62 of SRR1 is set to 1 
by the interrupt). If a System Reset or Machine 
Check interrupt is not recoverable (i.e., if bit 62 of 
SRR1 is set to O by the interrupt), it acts like a 
context synchronizing operation with respect to sub- 
sequent instructions. That is, a non-recoverable 
System Reset or Machine Check interrupt need not 
satisfy items 1 through 3 of Section 1.6.1, but does 
satisfy items 4 and 5. 
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5.3 Interrupt Classes 


Interrupts are classified by whether they are directly 
caused by the execution of an instruction or are 
caused by some other system exception. Those that 
are “system-caused” are: 


System Reset 
Machine Check 
External 
Decrementer 


External and Decrementer are maskable interrupts. 
While MSREç=0, the interrupt mechanism ignores the 
exceptions that generate these interrupts. Therefore, 
software may delay the generation of these interrupts 
by setting MSRee=0 or by failing to set MSRgg 1 
after processing an interrupt. When any interrupt is 
taken, MSR is set to O by the interrupt mechanism, 
delaying the recognition of any further exceptions 
causing these interrupts. 


System Reset and Machine Check exceptions are not 
maskable. These exceptions will be recognized 
regardless of the setting of the MSR. 


“Instruction-caused” interrupts are further divided 
into two classes, precise and imprecise. 


5.3.1 Precise Interrupt 


Except for the Imprecise Mode Floating-Point Enabled 
Exception type Program interrupt, all instruction- 
caused interrupts are precise. When the fetching or 
execution of an instruction causes a precise interrupt, 
the following conditions exist at the interrupt point. 


1. SRRO addresses either the instruction causing the 
exception or the immediately following instruc- 
tion. Which instruction is addressed can be 
determined from the interrupt type and status 
bits. 


2. An interrupt is generated such that all 
instructions preceding the instruction causing the 
exception appear to have completed with respect 
to the executing processor. However, some 
storage accesses associated with these preceding 
instructions may not have been performed with 
respect to other processors and mechanisms. 


3. The instruction causing the exception may appear 
not to have begun execution (except for causing 
the exception), may have been partially executed, 
or may have completed, depending on the inter- 
rupt type. 


4. Architecturally, no subsequent instruction has 
begun execution. 


5.3.2 Imprecise Interrupt 


This architecture defines one imprecise interrupt, the 
Imprecise Mode Floating-Point Enabled Exception type 
Program interrupt. 


When the execution of an instruction causes an impre- 
cise interrupt, the following conditions exist at the 
interrupt point. 


1. SRRO addresses either the instruction causing the 
exception or some instruction following the 
instruction causing the exception that generated 
the interrupt. 


2. An interrupt is generated such that all 
instructions preceding the instruction addressed 
by SRRO appear to have completed with respect 
to the executing processor. 


3. If the imprecise interrupt is forced by the context 
synchronizing mechanism, due to an instruction 
that causes another interrupt (e.g., Alignment, 
Data Storage), then SRRO addresses the 
interrupt-forcing instruction, and the interrupt- 
forcing instruction may have been partially exe- 
cuted (see Section 5.6, “Partially Executed 
Instructions” on page 63). 


4. If the imprecise interrupt is forced by the exe- 
cution synchronizing mechanism, due to exe- 
cuting an execution synchronizing instruction 
other than syne or isync, then SRRO addresses 
the interrupt-forcing instruction, and the interrupt- 
forcing instruction appears not to have begun 
execution (except for forcing the imprecise inter- 
rupt). If the imprecise interrupt is forced by a 
sync or isync instruction, then SRRO may address 
either the syne or isync instruction, or the fol- 
lowing instruction. 


5. If the imprecise interrupt is not forced by either 
the context synchronizing mechanism or the exe- 
cution synchronizing mechanism, then the instruc- 
tion addressed by SRRO appears not to have 
begun execution, if it is not the excepting instruc- 
tion. 


6. No instruction following the instruction addressed 
by SRRO appears to have begun execution. 


All Floating-Point Enabled Exception type Program 
interrupts are maskable using the MSR bits FEO and 
FE1. Although these interrupts are maskable, they 
differ significantly from the other maskable interrupts 
in that the masking of these interrupts is usually con- 
trolled by the application program, whereas the 
masking of External and Decrementer interrupts is 
controlled by the operating system. 
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Architecture Note 


An implementation may define one or more addi- 
tional interrupts to be imprecise. If this is done, 
then a complete description of how such impre- 
cise interrupts are implemented by the processor 
and how they are to be handled by the operating 
system can be found in the Book IV, PowerPC AS 
Implementation Features document for the imple- 
mentation. Such an implementation must provide 
a means of forcing the processor to process inter- 
rupts in a precise fashion as described here, 
perhaps with reduced performance. 


The discussion here assumes that only the Impre- 
cise Mode Floating-Point Enabled Exception type 
Program interrupt is imprecise. 


Programming Note 


In general, when an interrupt occurs, the following 
instructions should be executed by the operating 
system before dispatching a “new” program. 


m stwcx. or stdcx., to clear the reservation if 
one is outstanding, to ensure that a /warx or 
Idarx in the interrupted program is not paired 
with a stwcx. or stdcx. in the “new” program. 


m sync, to ensure that all storage accesses 
caused by the interrupted program will be 
performed with respect to another processor 
before the program is resumed on that other 
processor. 


m isyncor rfid, to ensure that the instructions in 
the “new” program execute in the “new” 
context. 


5.4 Interrupt Processing 


Associated with each kind of interrupt is an interrupt 
vector, which contains the initial sequence of 
instructions that is executed when the corresponding 
interrupt occurs. 


Interrupt processing consists of saving a small part of 
the processor's state in certain registers, identifying 
the cause of the interrupt in other registers, and con- 
tinuing execution at the corresponding interrupt 
vector location. When an exception exists that will 
cause an interrupt to be generated and it has been 
determined that the interrupt will occur, the following 
actions are performed. The handling of Machine 
Check interrupts (see Section 5.5.2) differs from the 
description given below in several respects. 


1. SRRO is loaded with an instruction address that 
depends on the type of interrupt; see the specific 
interrupt description for details. 


2. Bits 33:36 and 42:47 of SRR1 are loaded with 
information specific to the interrupt type. 


3. Bits 0:32, 37:41, and 48:63 of SRR1 are loaded 
with a copy of the corresponding bits of the MSR. 


4. The MSR is set as shown in Figure 27 on 
page 54. In particular, MSR bits IR and DR are 
set to 0, disabling relocation, and MSR bit SF is 
set to 1, selecting 64-bit mode. The new values 
take effect beginning with the first instruction 
executed following the interrupt. 


5. Instruction fetch and execution resumes, using 
the new MSR value, at the effective address spe- 
cific to the interrupt type. These effective 
addresses are shown in Figure 28 on page 54. 


Interrupts do not clear reservations obtained with 
Iwarx or Idarx. 


Programming Note 


In order to handle Machine Check and System 
Reset interrupts correctly, the operating system 
should manage MSRR, as follows. 


m In the Machine Check and System Reset 
interrupt handlers, interpret SRR1 bit 62 
(where MSRR, is placed) as: 


— O: interrupt is not recoverable 
— 4: interrupt is recoverable 


m In each interrupt handler, when enough state 
has been saved that a Machine Check or 
System Reset interrupt can be recovered 
from, set MSRR, to 1. 


m In each interrupt handler, do the following (in 
order) just before returning. 


1. Set MSRa, to 0. 

2. Set SRRO and SRR1 to the values to be 
used by rfid. The new value of SRRI 
should have bit 62 set to 1 (which will 
happen naturally if SRR1 is restored to 
the value saved there by the interrupt, 
because the interrupt handler will not be 
executing this sequence unless the inter- 
rupt is recoverable). 

3. Execute rfid. 


MSRp, can be managed similarly to handle inter- 
rupts other than Machine Check and System 
Reset that occur within interrupt handlers. 


This Note describes only the management of 
MSR Ry). It is not intended to be a full description 
of the requirements for an interrupt handler. 


Engineering Note 


Implementations that use emulation assists must 
report in SRRO the effective address of the 
instruction being emulated, and in the DAR if 
applicable the effective address that would have 
been computed by the instruction being emulated. 
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5.5 Interrupt Definitions 


Figure 27 shows all the types of interrupts and the 
values assigned to the MSR for each. Figure 28 
shows the effective address of the interrupt vector for 
each interrupt type. (Section 4.2.7 on page 27 sum- 
marizes all architecturally defined uses of effective 
addresses, including those implied by Figure 28.) 


MSR Bit 
IR DR FEOFE1 EE RI ME HV 


Interrupt Type 


O 
O 
O 
O 


System Reset 
Machine Check 
Data Storage 

Data Segment 
Instruction Storage 
Instruction Segment 
External 

Alignment 

Program 

FP Unavailable 
Decrementer 
System Call 

Trace 

Performance Monitor 


S000000000000 
S0O000000000000 
S0O000000000000 
S000000000000 
S000000000000 
S000000000000 
33°333333333-+-+ 


bit is set to 0 

bit is set to 1 

bit is not altered 

if LPES=0, set to 1; otherwise (implementa- 
tion-dependent) set to 0 or unchanged 

if LEV=1 or LPES=0, set to 1; otherwise 
(implementation-dependent) set to 0 or 
unchanged 


Bits BE, FP, PMM, PR, and SE are set to 0. 


If the optional Little-Endian facility is implemented 
(see the section entitled “Little-Endian” in Book |), 
the bits associated with the facility are set as 
follows. The ILE bit is not altered. The LE bit is 
copied from the ILE bit. 


Bit SF is set to 1. 


Reserved bits are set as if written as 0. 
Figure 27. MSR setting due to interrupt 


Programming Note 


For all the cases in which it is implementa- 
tion-dependent whether the interrupt sets MSRuyy 
to 0 or leaves it unchanged, the bit should already 
be 0, so the fact that some implementations do 
not set it to 0 does not matter. 
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Effective Interrupt Type 
Address! 


..0000_0100 | System Reset 
..0000 0200 | Machine Check 
0000 0300 | Data Storage 

0000 0380 | Data Segment 
0000 0400 | Instruction Storage 
..0000 0480 | Instruction Segment 
-. 0000 0500 | External 

. 0000 0600 | Alignment 

0000 0700 | Program 

0000 0800 | Floating-Point Unavailable 
..0000_0900 | Decrementer 
..0000_0A00 | Reserved 

-. 0000 0BOO | Reserved 
..0000_0C00 | System Call 
..0000_0D00 | Trace 

.„0000_0E00 | Reserved 
..0000_0E10 | Reserved 


Reserved 
Performance Monitor 
Reserved 


..0000_OEFF 
..0000_0F00 
..0000_0F10 
..0000_OFFF | Reserved 
1 The values in the Effective Address column are 
interpreted as follows. 
m 00..0000 nnnn means 0x0000 0000 0000 nnnn 
2 Effective addresses 0x0000 0000 0000 0000 
through 0x0000 0000 0000 OOFF are used by 
software and will not be assigned as interrupt 
vectors. 


Figure 28. Effective address of interrupt vector by 
interrupt type 


Programming Note 


When address translation is disabled, use of any 
of the effective addresses that are shown as 
reserved in Figure 28 risks incompatibility with 
future implementations. 
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5.5.1 System Reset Interrupt 


If a System Reset exception causes an interrupt that 
is not context synchronizing, or causes the loss of a 
Machine Check exception, an External exception, or a 
Floating-Point Enabled Exception type Program excep- 
tion, the interrupt is not recoverable. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that the processor would have 
attempted to execute next if no interrupt 
conditions were present. 


SRR1 
33:36 Set to 0. 
42:47  Setto0. 


62 Loaded from bit 62 of the MSR if the 
processor is in a recoverable state; other- 
wise set to 0. 


Others Loaded from the MSR. 
MSR See Figure 27 on page 54. 


Execution resumes at effective address 


0x0000_0000_0000_0100. 


Each implementation provides a means for software 
to distinguish power-on Reset from other types of 
System Reset, and describes it in the Book IV, 
PowerPC AS Implementation Features document for 
the implementation. 


Engineering Note 


Every attempt should be made to allow continuing 
execution. 


If the result of a System Reset interrupt is the 
same as that produced by an External interrupt 
with the exception of where execution resumes, 
the interrupt is recoverable. This condition exists 
if none of the specified exceptions have been lost 
and if the state of the processor has not been cor- 
rupted by an error in the processor. 


5.5.2 Machine Check Interrupt 


The causes of Machine Check interrupts are imple- 
mentation-dependent. For example, a Machine Check 
interrupt may be caused by a reference to a storage 
location that contains an uncorrectable error or does 
not exist (see Section 4.2.8, “Invalid Real Address” on 
page 27), or by an error in the storage subsystem. 


Machine Check interrupts are enabled when 
MSRye=1- If MSRyg=0 and a Machine Check 
occurs, the processor enters the Checkstop state. 
The Checkstop state may also be entered if an access 
is attempted to a storage location that does not exist 
(see Section 4.2.8). 


Disabled Machine Check (Checkstop State) 


When a processor is in Checkstop state, instruction 
processing is suspended and generally cannot be 
restarted without resetting the processor. Some 
implementations may preserve some or all of the 
internal state of the processor when entering 
Checkstop state, so that the state can be analyzed as 
an aid in problem determination. 


Enabled Machine Check 


If a Machine Check exception causes an interrupt that 
is not context synchronizing, or causes the loss of an 
External exception or a Floating-Point Enabled Excep- 
tion type Program exception, the interrupt is not 
recoverable. 


In some systems, the operating system may attempt 
to identify and log the cause of the Machine Check. 


The following registers are set: 


SRRO Set on a “best effort” basis to the effective 
address of some instruction that was exe- 
cuting or was about to be executed when 
the Machine Check exception occurred. 
For further details see the Book IV, 
PowerPC AS Implementation Features doc- 
ument for the implementation. 


SRR1 
62 Loaded from bit 62 of the MSR if the 
processor is in a recoverable state; other- 
wise set to 0. 


Others See the Book IV, PowerPC AS Implementa- 
tion Features document for the implemen- 


tation. 
MSR See Figure 27 on page 54. 
Execution resumes at effective address 


0x0000_0000_0000_0200. 


Programming Note 


If a Machine Check interrupt is caused by an error 
in the storage subsystem, the storage subsystem 
may return incorrect data, which may be placed 
into registers. This corruption of register contents 
may occur even if the interrupt is recoverable. 
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Engineering Note 


Every attempt should be made to allow continuing 
execution. 


If the result of a Machine Check interrupt is the 
same as that produced by an External interrupt 
with the exception of where execution resumes, 
the interrupt is recoverable. This condition exists 
if none of the specified exceptions have been lost 
and if the state of the processor has not been cor- 
rupted by an error in the processor. A load oper- 
ation that places data, possibly corrupted by the 
storage subsystem, into a GPR or FPR does not 
make the interrupt unrecoverable. 


5.5.3 Data Storage Interrupt 


A Data Storage interrupt occurs when no higher pri- 
ority exception exists and a data access cannot be 
performed for any of the following reasons. 


m Data address translation is enabled (MSRpp=1) 
and the virtual address of any byte of the storage 
location specified by a Load, Store, icbi, dcbz, 
dcbst, dcbf, eciwx, or ecowx instruction cannot be 
translated to a real address. 

m The effective address specified by a Iwarx, Idarx, 
stwex., or stdcx. instruction refers to storage that 
is Write Through Required or Caching Inhibited. 

m The access violates storage protection. 

= A Data Address Compare match or a Data 
Address Breakpoint Register (DABR) match 
occurs. 

m Execution of an eciwx or ecowx instruction is dis- 
allowed because EAR-=0. 


If a stwex. or stdex. would not perform its store in the 
absence of a Data Storage interrupt, and either (a) 
the specified effective address refers to storage that 
is Write Through Required or Caching Inhibited, or (b) 
a non-conditional Store to the specified effective 
address would cause a Data Storage interrupt, it is 
implementation-dependent whether a Data Storage 
interrupt occurs. 


If a Move Assist instruction has a length of zero (in 
the XER), a Data Storage interrupt does not occur for 
reasons of address translation or storage protection, 
regardless of the effective address. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that caused the interrupt. 


SRR1 
33:36 Set to 0. 
42:47 ~— Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 
DSISR 
0 Set to 0. 
1 Set to 1 if MSRpp=1 and the translation 


for an attempted access is not found in the 
primary PTEG or in the secondary PTEG; 
otherwise set to 0. 

2:3 Set to 0. 

4 Set to 1 if the access is not permitted by 
the storage protection mechanism; other- 
wise set to 0. 


r— Programming Note 


The only cases in which DSISR, can be 
set to 1 for an access that occurs when 
MSRpa=0 are those described in 
Figure 23. These cases can be distin- 
guished from other causes of data 
storage protection violations by exam- 
ining SRRizg (the bit in which MSRpa 
was saved by the interrupt). 


5 Set to 1 if the access is due to a Iwarx, 
Idarx, stwex., or stdcx. instruction that 
addresses storage that is Write Through 
Required or Caching Inhibited; otherwise 
set to 0. 

6 Set to 1 for a Store, dcbz, or ecowx 
instruction; otherwise set to 0. 

7:8 Set to 0. 


9 Set to 1 if a Data Address Compare match 
or a DABR match occurs; otherwise set to 
0. 

10 Set to 0. 

11 Set to 1 if execution of an eciwx or ecowx 


instruction is attempted when EAR=0; 
otherwise set to 0. 

12:14 Set to 0. 

15 Set to 1 if MSRpp=1, the translation for an 
attempted access is found in the SLB, the 
translation is not found in the primary 
PTEG or in the secondary PTEG, and 
SLBE =1; otherwise set to 0. 

16:31 Set to 0. 


DAR Set to the effective address of a storage 
element as described in the following list. 
The list should be read from the top down; 
the DAR is set as described by the first 
item that corresponds to an exception that 
is reported in the DSISR. For example, if a 
Load instruction causes a storage pro- 
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tection violation and a DABR match (and 
both are reported in the DSISR), the DAR 
is set to the effective address of a byte in 
the first aligned doubleword for which 
access was attempted in the page that 
caused the exception. 
m a Data Storage exception occurs for 
reasons other than DABR match or, for 
eciwx and ecowx, EARz= 0 
— a byte in the block that caused the 
exception, for a Cache Manage- 
ment instruction 

— a byte in the first aligned 
doubleword for which access was 
attempted in the page that caused 
the exception, for a Load, Store, 
eciwx, or ecowx instruction (“first” 
refers to address order; see 
Section 5.7) 

m undefined, for a DABR match, or if 
eciwx or ecowx is executed when 
EARg O 

If the interrupt occurs in 32-bit mode, the 
high-order 32 bits of the DAR are set to 0. 


If multiple Data Storage exceptions occur for a given 
effective address, any one or more of the bits corre- 
sponding to these exceptions may be set to 1 in the 
DSISR. 


Programming Note 


More than one bit may be set to 1 in the DSISR in 
the following combinations. 


1, {s+} 
1, 15, {s+} 
4, {s+} 

4, 5, {s} 
5, {s} 
{s+} 


In this list, “(s)” represents any combination of 
the set of bits {6, 9} and “{s+}” adds bit 11 to 
this set. 


address 


effective 


Execution resumes at 
0x0000_0000_0000_0300. 


Engineering Note 


For initial hardware debug it is often useful to run 
with cache disabled. In some ways cache disa- 
bled mode is similar to Caching Inhibited storage. 


5.5.4 Data Segment Interrupt 


A Data Segment interrupt occurs when no higher pri- 
ority exception exists and a data access cannot be 
performed because data address translation is 
enabled (MSRpp=1) and the effective address of any 
byte of the storage location specified by a Load, 
Store, icbi, dcbz, dcbst, dcbf, eciwx, or ecowx instruc- 
tion cannot be translated to a virtual address. 


If a stwex. or stdex. would not perform its store in the 
absence of a Data Segment interrupt, and a non- 
conditional Store to the specified effective address 
would cause a Data Segment interrupt, it is imple- 
mentation-dependent whether a Data Segment inter- 
rupt occurs. 


If a Move Assist instruction has a length of zero (in 
the XER), a Data Segment interrupt does not occur, 
regardless of the effective address. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that caused the interrupt. 


SRR1 
33:36 Set to 0. 
42:47 — Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


DAR Set to the effective address of a storage 
element as described in the following list. 

m a byte in the block that caused the 
Data Segment interrupt, for a Cache 
Management instruction 

m a byte in the first aligned doubleword 
for which access was attempted in the 
segment that caused the Data 
Segment interrupt, for a Load, Store, 
eciwx, or ecowx instruction (“first” 
refers to address order; see Section 
5.7) 

If the interrupt occurs in 32-bit mode, the 

high-order 32 bits of the DAR are set to 0. 


address 


effective 


Execution resumes at 
0x0000 0000 0000 0380. 


Programming Note 


A Data Segment interrupt occurs if MSRpp= 1 and 
the translation of the effective address of any 
byte of the specified storage location is not found 
in the SLB. 
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5.5.5 Instruction Storage Interrupt 


An Instruction Storage interrupt occurs when no 
higher priority exception exists and the next instruc- 
tion to be executed cannot be fetched for any of the 
following reasons. 


m Instruction address translation is enabled 
(MSR;p=1) and the virtual address cannot be 
translated to a real address. 

m The fetch access violates storage protection. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that the processor would have 
attempted to execute next if no interrupt 
conditions were present (if the interrupt 
occurs on attempting to fetch a branch 
target, SRRO is set to the branch target 


address). 
SRR1 

33 Set to 1 if MSRip=1 and the translation for 
an attempted access is not found in the 
primary PTEG or in the secondary PTEG; 
otherwise set to 0. 

34 Set to 0. 

35 Set to 1 if the access occurs when 
MSR;p=1 and is to No-execute storage or 
to Guarded storage; otherwise set to 0. 

36 Set to 1 if the access is not permitted by 
Figure 22 or 23, as appropriate; otherwise 
set to 0. 


r— Programming Note 


The only cases in which SRR13, can be 
set to 1 for an access that occurs when 
MSR;p=0 are those described in 
Figure 23. These cases can be distin- 
guished from other causes of instruc- 
tion storage protection violations that 
set SRR1zg to 1 by examining SRR15g 
(the bit in which MSRya was saved by 
the interrupt). 


42:46 Set to 0. 

47 Set to 1 if MSRyp=1, the translation for an 
attempted access is found in the SLB, the 
translation is not found in the primary 
PTEG or in the secondary PTEG, and 
SLBE,=1; otherwise set to 0. 

Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


If multiple Instruction Storage exceptions occur due to 
attempting to fetch a single instruction, any one or 
more of the bits corresponding to these exceptions 
may be set to 1 in SRRI. 


Programming Note 


More than one bit may be set to 1 in SRR1 in the 
following combinations. 


33, 35 
33, 47 
33, 35, 47 
35, 36 


Execution resumes at effective address 


0x0000_0000_0000_0400. 


5.5.6 Instruction Segment Interrupt 


An Instruction Segment interrupt occurs when no 
higher priority exception exists and the next instruc- 
tion to be executed cannot be fetched because 
instruction address translation is enabled (MSRjp=1) 
and the effective address cannot be translated to a 
virtual address. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that the processor would have 
attempted to execute next if no interrupt 
conditions were present (if the interrupt 
occurs on attempting to fetch a branch 
target, SRRO is set to the branch target 
address). 


SRR1 
33:36 Set to 0. 
42:47 Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


Execution resumes at effective address 


0x0000_0000_0000_0480. 


Programming Note 


interrupt occurs if 
and the translation of the effective 
address of the next instruction to be executed is 
not found in the SLB. 


An Instruction 
MSRjp= 1 


Segment 
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5.5.7 External Interrupt 


An External interrupt occurs when no higher priority 
exception exists, an External interrupt exception is 
presented to the interrupt mechanism, and MSRee=1. 
The occurrence of the interrupt does not cancel the 
request. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that the processor would have 
attempted to execute next if no interrupt 
conditions were present. 


SRR1 
33:36 Set to 0. 
42:47 ~— Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


Execution resumes at effective address 


0x0000_0000_0000_0500. 


5.5.8 Alignment Interrupt 


An Alignment interrupt occurs when no higher priority 
exception exists and a data access cannot be per- 
formed for any of the following reasons. 


m The operand of a floating-point Load or Store is 
not word-aligned, or crosses a virtual page 
boundary. 


m The operand of Imw, stmw, lwarx, Idarx, stwex., 
stdcx., eciwx, or ecowx is not aligned. 


m The operand of a single-register Load or Store is 
not aligned and the processor is in Little-Endian 
mode. 


m The instruction is Imw, stmw, Iswi, Iswx, stswi, or 
stswx, and the operand is in storage that is Write 
Through Required or Caching Inhibited, or the 
processor is in Little-Endian mode. 


m The operand of a Load or Store crosses a 
segment boundary, or crosses a boundary 
between virtual pages that have different storage 
control attributes. 


m The operand of a Load or Store is not aligned and 
is in storage that is Write Through Required or 
Caching Inhibited. 


m The operand of dcbz, Iwarx, Idarx, stwex., or 
stdcx. is in storage that is Write Through 
Required or Caching Inhibited. 


Engineering Note 


Early implementations have additional require- 
ments for Imw and stmw, for reasons of compat- 
ibility with the POWER Architecture. See the 
section entitled “Load/Store Multiple Instructions” 
in the “Incompatibilities with the POWER Architec- 
ture” appendix of Book I. 


If a stwex. or stdex. would not perform its store in the 
absence of an Alignment interrupt and the specified 
effective address refers to storage that is Write 
Through Required or Caching Inhibited, it is imple- 
mentation-dependent whether an Alignment interrupt 
occurs. 


Setting the DSISR and DAR as described below is 
optional for implementations on which Alignment 
interrupts occur rarely, if ever, for cases that the 
Alignment interrupt handler emulates. For such 
implementations, if the DSISR and DAR are not set as 
described below they are set to undefined values. 


Engineering Note 


For a given implementation, decisions regarding 
whether to set the DSISR and DAR as described 
in the remainder of this section, and what poten- 
tial causes of Alignment interrupts actually cause 
Alignment interrupts, must include consideration 
of the cases that the Alignment interrupt handler 
would emulate, and of the effect of such emulation 
on software performance. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that caused the interrupt. 


SRR1 
33:36 Set to 0. 
42:47 Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


DSISR 
0:11 Set to 0. 
12:13 Set to bits 30:31 of the instruction if 


DS-form. 
Set to 0b00 if D- or X-form. 

14 Set to 0. 

15:16 Set to bits 29:30 of the instruction if X-form. 
Set to 0b00 if D- or DS-form. 


17 Set to bit 25 of the instruction if X-form. 
Set to bit 5 of the instruction if D- or 
DS-form. 


18:21 Set to bits 21:24 of the instruction if X-form. 
Set to bits 1:4 of the instruction if D- or 
DS-form. 
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22:26 Set to bits 6:10 of the instruction 
(RT/RS/FRT/FRS), except undefined for 
dcbz. 

27:31 + Set to bits 11:15 of the instruction (RA) for 
update form instructions; set to either bits 
11:15 of the instruction or to any register 
number not in the range of registers to be 
loaded for a valid form Imw, a valid form 
Iswi, or a valid form Iswx for which neither 
RA nor RB is in the range of registers to be 
loaded; otherwise undefined. 


m Engineering Note 


The requirement for Imw, Iswi, and Iswx 
ensures that the program that emulates 
these instructions when they cause an 
Alignment interrupt on the 601 can also 
be used on subsequent PowerPC AS 
implementations. (601 implements 
POWER semantics for these 
instructions, preserving RA when it is in 
the range to be loaded and is not 0. 
Therefore the 601 Alignment interrupt 
handler must do the same. Software 
wants to use the same Alignment inter- 
rupt handler for all PowerPC AS imple- 
mentations. This requires that the “RA 
field” saved in the DSISR for post-601 
implementations not be in the range 
that would be loaded if the effective 
address were aligned.) 


For Imw, the requirement can be met 
either by storing zeros or by storing the 
RT field with 1 subtracted from it. For 
Iswi and Iswx, it can be met by storing 
the RT field with 1 subtracted from it 
(the Load String instructions wrap from 
GPR 31 to GPR 0, so simply storing 
zeros is not adequate). 


DAR Set to the effective address computed by 
the instruction, except that if the interrupt 
occurs in 32-bit mode the high-order 32 bits 
of the DAR are set to 0. 


For an X-form Load or Store, it is acceptable for the 
processor to set the DSISR to the same value that 
would have resulted if the corresponding D- or 
DS-form instruction had caused the interrupt. Simi- 
larly, for a D- or DS-form Load or Store, it is accept- 
able for the processor to set the DSISR to the value 
that would have resulted for the corresponding X-form 
instruction. For example, an unaligned Iwax (that 
crosses a protection boundary) would normally, fol- 
lowing the description above, cause the DSISR to be 
set to binary: 


where “ttttt” denotes the RT field, and “?????” 
denotes an undefined 5-bit value. However, it is 


acceptable if it causes the DSISR to be set as for Iwa, 
which is 


If there is no corresponding alternative form instruc- 
tion (e.g., for Iwaux), the value described above is set 
in the DSISR. 


The instruction pairs that may use the same DSISR 
value are: 


lhz/lhzx lhzu/lhzux lha/lhax lhau/lhaux 
lwz/lwzx lwzu/lwzux lwa/lwax 

ld/1dx ldu/1dux 

sth/sthx sthu/sthux stw/stwx stwu/stwux 
std/stdx stdu/stdux 

lfs/lfsx lfsu/lfsux lfd/lfdx lfdu/lfdux 


stfs/stfsx stfsu/stfsux stfd/stfdx stfdu/stfdux 


Execution resumes at effective address 


0x0000_0000_0000_0600. 


Programming Note 


The architecture does not support the use of an 
unaligned effective address by Iwarx, Idarx, 
stwcx., stdcx., eciwx, and ecowx. If an Alignment 
interrupt occurs because one of these instructions 
specifies an unaligned effective address, the 
Alignment interrupt handler must not attempt to 
simulate the instruction, but instead should treat 
the instruction as a programming error. 


5.5.9 Program Interrupt 


A Program interrupt occurs when no higher priority 
exception exists and one of the following exceptions 
arises during execution of an instruction: 


Floating-Point Enabled Exception 


A Floating-Point Enabled Exception type Program 
interrupt is generated when the expression 


(MSRpeo | MSRegy) & FPSCRrex 


is 1. FPSCRrex is set to 1 by the execution of a 
floating-point instruction that causes an enabled 
exception, including the case of a Move To 
FPSCR instruction that causes an exception bit 
and the corresponding enable bit both to be 1. 


Illegal Instruction 


An Illegal Instruction type Program interrupt is 
generated when execution is attempted of an 
illegal instruction, or of a reserved or optional 
instruction that is not provided by the implemen- 
tation. 


An Illegal Instruction type Program interrupt may 
be generated when execution is attempted of any 
of the following kinds of instruction. 


m an instruction that is in invalid form 
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m an Iswx instruction for which RA or RB is in 
the range of registers to be loaded 


m an mtspr or mfspr instruction with an SPR 
field that does not contain one of the defined 
values, or an mftb instruction with a TBR 
field that does not contain one of the defined 
values 


Engineering Note 


Early implementations have additional 
requirements for instructions that are 
reserved because they correspond to non- 
privileged POWER instructions that are not in 
PowerPC AS, and for mtspr and mfspr, for 
reasons of compatibility with the POWER 
Architecture. See the sections entitled “Move 
To/From SPR” and “Discontinued Opcodes” in 
the “Incompatibilities with the POWER Archi- 
tecture” appendix of Book I. 


Privileged Instruction 


The following applies if the instruction is executed 
when MSRpp = 1. 


A Privileged Instruction type Program inter- 
rupt is generated when execution is 
attempted of a privileged instruction, or of an 
mtspr or mfspr instruction with an SPR field 
that contains one of the defined values 
having sprg=1. It may be generated when 


execution is attempted of an mtspr or mfspr 


instruction with an SPR field that does not 
contain one of the defined values but has 
Spro=1, or when execution is attempted of an 
mftb instruction with a TBR field that does 
not contain one of the defined values but has 
toro=1. 


The following applies if the instruction is executed 
when MSRuyy PR= 0b00. 


A Privileged Instruction type Program inter- 
rupt may be generated when execution is 
attempted of an mtspr instruction with an 
SPR field that designates a hypervisor 
resource (see Section 1.7, “Logical Parti- 
tioning (LPAR)” on page 4), or when exe- 
cution of a tlbie or tlbsync instruction is 
attempted. 


Programming Note 


These are the only cases in which a Privi- 
leged Instruction type Program interrupt 
can be generated when MSRpp=0. They 
can be distinguished from other causes of 
Privileged Instruction type Program inter- 
rupts by examining SRRi4 (the bit in 
which MSRpp was saved by the inter- 
rupt). 


Trap 


A Trap type Program interrupt is generated when 
any of the conditions specified in a Trap instruc- 
tion is met. 


The following registers are set: 


SRRO 


SRR1 
33:36 
42 
43 
44 
45 
46 


47 


For all Program interrupts except a 
Floating-Point Enabled Exception when in 
one of the Imprecise modes, set to the 
effective address of the instruction that 
caused the interrupt. 


For an Imprecise Mode Floating-Point 
Enabled Exception, set to the effective 
address of the excepting instruction or to 
the effective address of some subsequent 
instruction. If SRRO points to a subsequent 
instruction, that instruction has not been 
executed. If a subsequent instruction is 
isync or sync, SRRO will not point more 
than four bytes beyond the isyne or sync 
instruction. 


If FPSCRrex= 1 but Floating-Point Enabled 
Exception type Program interrupts are dis- 
abled by having both MSRreg and MSRee+ 
= 0, a Floating-Point Enabled Exception 
type Program interrupt will occur prior to 
or at the next synchronizing event if these 
MSR bits are altered by any instruction 
that can set the MSR so that the 
expression 


(MSRreo | MSRpg1) & PPSCRrex 


is 1. When this occurs, SRRO is loaded 
with the address of the instruction that 
would have executed next, not with the 
address of the instruction that modified the 
MSR causing the interrupt. 


Set to 0. 

Set to 0. 

Set to 1 for a Floating-Point Enabled Excep- 
tion type Program interrupt; otherwise set 
to 0. 

Set to 1 for an Illegal Instruction type 
Program interrupt; otherwise set to 0. 

Set to 1 for a Privileged Instruction type 
Program interrupt; otherwise set to 0. 

Set to 1 for a Trap type Program interrupt; 
otherwise set to 0. 

Set to 0 if SRRO contains the address of 
the instruction causing the exception, and 
to 1 if SRRO contains the address of a sub- 
sequent instruction. 


Others Loaded from the MSR. 
Only one of bits 43:46 can be set to 1. 


MSR 


Execution 


See Figure 27 on page 54. 


resumes at effective address 


0x0000_0000_0000_0700. 
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Engineering Note 


If the Imprecise Recoverable Mode Floating-Point 
Enabled Exception type Program interrupt is 
implemented as imprecise, the hardware must 
provide, at the minimum, the address at which to 
resume the interrupted process (this is given in 
SRRO), the excepting instruction's opcode, 
extended opcode, and record bit, the source 
values or registers, and the target register. This 
information can be provided directly in registers 
or by means of a pointer to the excepting instruc- 
tion. The manner in which it is provided is 
described in the Book IV, PowerPC AS Implemen- 
tation Features document for the implementation. 


5.5.10 Floating-Point Unavailable 
Interrupt 


A Floating-Point Unavailable interrupt occurs when no 
higher priority exception exists, an attempt is made to 
execute a floating-point instruction (including floating- 
point loads, stores, and moves), and MSRrp=0. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that caused the interrupt. 


SRR1 
33:36 Set to 0. 
42:47 ~— Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


Execution resumes at effective address 


0x0000_0000_0000_0800. 


5.5.11 Decrementer Interrupt 


A Decrementer interrupt occurs when no higher pri- 
ority exception exists, a Decrementer exception 
exists, and MSRgg=1. The occurrence of the inter- 
rupt cancels the request. 


The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that the processor would have 
attempted to execute next if no interrupt 
conditions were present. 


SRR1 
33:36 Set to 0. 
42:47 Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


Execution resumes at effective address 


0x0000_0000_0000_0900. 


5.5.12 System Call Interrupt 


A System Call interrupt occurs when a System Call 

instruction is executed. 

The following registers are set: 

SRRO Set to the effective address of the instruc- 
tion following the System Call instruction. 


SRR1 
33:36 Set to 0. 
42:47 ~— Set to 0. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


address 


effective 


Execution resumes at 
0x0000_0000_0000_0C00. 


Programming Note 


An attempt to execute an sc instruction with 
LEV=1 in problem state should be treated as a 
programming error. 
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5.5.13 Trace Interrupt 


A Trace interrupt occurs when no higher priority 
exception exists and either MSRge=1 and any 
instruction except rfid is successfully completed, or 
MSRege=1 and a Branch instruction is completed. 
Successful completion means that the instruction 
caused no other interrupt. Thus a Trace interrupt 
never occurs for a System Call instruction, or for a 
Trap instruction that traps. The instruction that 
causes a Trace interrupt is called the “traced instruc- 
tion”. 


When a Trace interrupt occurs, the following registers 
are set: 


SRRO Set to the effective address of the instruc- 
tion that the processor would have 
attempted to execute next if no interrupt 
conditions were present. 


SRR1 
33:36 and 42:47 See the Book IV, PowerPC AS 
Implementation Features document for the 
implementation. 
Others Loaded from the MSR. 


MSR See Figure 27 on page 54. 


Execution resumes at effective address 


0x0000 0000 0000 ODOVO. 


Extensions to the Trace facility are described in 
Appendix F, “Example Trace Extensions (Optional)” 
on page 103. 


Programming Note 
The following instructions are not traced. 


m rfid 

m sc, and Trap instructions that trap 

m other instructions that cause interrupts (other 
than Trace interrupts) 

m the first instructions of any interrupt handler 

m instructions that are emulated by software 


In general, interrupt handlers can achieve the 
effect of tracing these instructions. 


Architecture Note 


If a Trace interrupt were permitted after an rfid, 
the Trace interrupt handler would never be able 
to return to a program for which MSRgp=1. 


5.5.14 Performance Monitor Interrupt 
(Optional) 


The Performance Monitor interrupt is part of the 
optional Performance Monitor facility; see 
Appendix E. If the Performance Monitor facility is not 
implemented or does not use this interrupt, the corre- 
sponding interrupt vector (see Figure 28 on page 54) 
is treated as reserved. 


5.6 Partially Executed 
Instructions 


If a system-caused, Data Storage, Data Segment, or 
Alignment exception occurs while a Load or Store 
instruction is executing, the instruction may be 
aborted. In such cases the instruction is not com- 
pleted, but may have been partially executed in the 
following respects. 


= Some of the bytes of the storage operand may 
have been accessed, except that if access to a 
given byte of the storage operand would violate 
storage protection, that byte is neither copied to 
a register by a Load instruction nor modified by a 
Store instruction. Also, the rules for storage 
accesses given in Section 4.2.4.1, “Guarded 
Storage” on page 24 and in the section entitled 
“Instruction Restart” in Book Il are obeyed. 


m Some registers may have been altered as 
described in the Book Il section cited above. 


m Reference and Change bits may have been 
updated as described in Section 4.8. 


m For a stwcx. or stdex. instruction that is executed 
in-order, CRO may have been set to an undefined 
value and the reservation may have been 
cleared. 


The architecture does not support continuation of an 
aborted instruction but intends that the aborted 
instruction be re-executed if appropriate. 
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Programming Note 


An exception may result in the partial execution of 
a Load or Store instruction. For example, if the 
Page Table Entry that translates the address of 
the storage operand is altered, by a program 
running on another processor, such that the new 
contents of the Page Table Entry preclude per- 
forming the access, the alteration could cause the 
Load or Store instruction to be aborted after 
having been partially executed. 


As stated in the Book Il section cited above, if an 
instruction is partially executed the contents of 
registers are preserved to the extent that the 
instruction can be re-executed correctly. The con- 
sequent preservation is described in the following 
list. For any given instruction, zero or one item in 
the list applies. 


m For a fixed-point Load instruction that is not a 
multiple or string form, or for an eciwx 
instruction, if RT = RA or RT = RB then the 
contents of register RT are not altered. 


m For an update form Load or Store instruction, 
the contents of register RA are not altered. 


5.7 Exception Ordering 


Since multiple exceptions can exist at the same time 
and the architecture does not provide for reporting 
more than one interrupt at a time, the generation of 
more than one interrupt is prohibited. Also some 
exceptions would be lost if they were not recognized 
and handled when they occurred. For example, if an 
External interrupt was generated when a Data 
Storage exception existed, the Data Storage exception 
would be lost. If the Data Storage exception was 
caused by a Store Multiple instruction for which the 
storage operand crosses a virtual page boundary and 
the exception was a result of attempting to access the 
second virtual page, the store could have modified 
locations in the first virtual page even though it 
appeared that the Store Multiple instruction was 
never executed. 


In addition, the architecture defines imprecise inter- 
rupts that must be recoverable, cannot be lost, and 
can occur at any time with respect to the executing 
instruction stream. Some of the maskable and non- 
maskable exceptions are persistent and can be 
deferred. The following exceptions persist even 
though some other interrupt is generated: 


m Floating-Point Enabled Exceptions 
m External 
m Decrementer 


For the above reasons, all exceptions are prioritized 
with respect to other exceptions that may exist at the 
same instant to prevent the loss of any exception that 


is not persistent. Some exceptions cannot exist at the 
same instant as some others. 


Data Storage, Data Segment, and Alignment excep- 
tions occur as if the storage operand were accessed 
one byte at a time in order of increasing effective 
address (with the obvious caveat if the operand 
includes both the maximum effective address and 
effective address 0). 


5.7.1 Unordered Exceptions 


The exceptions listed here are unordered, meaning 
that they may occur at any time regardless of the 
state of the interrupt processing mechanism. These 
exceptions are recognized and processed when pre- 
sented. 


1. System Reset 
2. Machine Check 


5.7.2 Ordered Exceptions 


The exceptions listed here are ordered with respect to 
the state of the interrupt processing mechanism. 


System-Caused or Imprecise 


1. Program 

- Imprecise Mode Floating-Point Enabled Exception 
2. External 
3. Decrementer 


Instruction-Caused and Precise 


1. Instruction Segment 
2. Instruction Storage 
3. Program 
- Illegal Instruction 
- Privileged Instruction 
4. Function-Dependent 
4.a Fixed-Point 
ta Program 
- Trap 
1b System Call 
tc Data Storage, Data Segment, or Alignment 
2 Trace 


4.b Floating-Point 
1 FP Unavailable 
2a Program 
- Precise Mode Floating-Point Enabled Excep'n 
2b Data Storage, Data Segment, or Alignment 
3 Trace 


For implementations that execute multiple instructions 
in parallel using pipeline or superscalar techniques, or 
combinations of these, it can be difficult to understand 
the ordering of exceptions. To understand this 
ordering it is useful to consider a model in which each 
instruction is fetched, then decoded, then executed, 
all before the next instruction is fetched. In this 
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model, the exceptions a single instruction would gen- 
erate are in the order shown in the list of instruction- 
caused exceptions. Exceptions with different numbers 
have different ordering. Exceptions with the same 
numbering but different lettering are mutually exclu- 
sive and cannot be caused by the same instruction. 
Where Data Storage, Data Segment, and Alignment 
exceptions are listed in the same item they have 
equal ordering. 


Even on processors that are capable of executing 
several instructions simultaneously, or out of order, 
instruction-caused interrupts (precise and imprecise) 
occur in program order. 


5.8 Interrupt Priorities 


This section describes the relationship of nonmask- 
able, maskable, precise, and imprecise interrupts. In 
the following descriptions, the interrupt mechanism 
waiting for all possible exceptions to be reported 
includes only exceptions caused by previously initi- 
ated instructions (e.g., it does not include waiting for 
the Decrementer to step through zero). The excep- 
tions are listed in order of highest to lowest priority. 


1. System Reset 


System Reset exception has the highest priority 
of all exceptions. If this exception exists, the 
interrupt mechanism ignores all other exceptions 
and generates a System Reset interrupt. 


Once the System Reset interrupt is generated, no 
nonmaskable interrupts are generated due to 
exceptions caused by instructions issued prior to 
the generation of this interrupt. 


2. Machine Check 


Machine Check exception is the second highest 
priority exception. If this exception exists and a 
System Reset exception does not exist, the inter- 
rupt mechanism ignores all other exceptions and 
generates a Machine Check interrupt. 


Once the Machine Check interrupt is generated, 
no nonmaskable interrupts are generated due to 
exceptions caused by instructions issued prior to 
the generation of this interrupt. 


3. Instruction-Dependent 


This exception is the third highest priority excep- 
tion. When this exception is created, the interrupt 
mechanism waits for all possible Imprecise 
exceptions to be reported. It then generates the 
appropriate ordered interrupt if no higher priority 
exception exists when the interrupt is to be gen- 
erated. Within this category a particular instruc- 
tion may present more than a single exception. 
When this occurs, those exceptions are ordered in 
priority as indicated in the following lists. Where 


Data Storage, Data Segment, and Alignment 
exceptions are listed in the same item they have 
equal priority (i.e., the processor may generate 
any one of the three interrupts for which an 
exception exists). 


A. Fixed-Point Loads and Stores 


a. Data Storage, Data Segment, or Align- 
ment 
b. Trace 


B. Floating-Point Loads and Stores 


a. Floating-Point Unavailable 

b. Data Storage, Data Segment, or Align- 
ment 

c. Trace 


C. Other Floating-Point Instructions 


a. Floating-Point Unavailable 

b. Program - Precise Mode Floating-Point 
Enabled Exception 

c. Trace 


D. rfid and mtmsr{ d] 


a. Program - Privileged Instruction 

b. Program - Precise Mode Floating-Point 
Enabled Exception 

c. Trace, for mtmsr[ d] only 


If the MSR bits FEO and FE1 are set such that 
Precise Mode Floating-Point Enabled Excep- 
tion type Program interrupts are enabled and 
FPSCR bit FEX is set, a Program interrupt will 
result prior to or at the next synchronizing 
event. 


E. Other Instructions 


a. These exceptions are mutually exclusive 
and have the same priority: 
m Program - Trap 
m System Call 
m Program - Privileged Instruction 
m Program - Illegal Instruction 
b. Trace 


F. Instruction Storage and Instruction Segment 


These exceptions have the lowest priority in 
this category. They are recognized only 
when all instructions prior to the instruction 
causing one of these exceptions appear to 
have completed and that instruction is the 
next instruction to be executed. The two 
exceptions are mutually exclusive. 


The priority of these exceptions is specified 
for completeness and to ensure that they are 
not given more favorable treatment. It is 
acceptable for an implementation to treat 
these exceptions as though they had a lower 
priority. 
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4. Program - Imprecise Mode Floating-Point Enabled 


Exception 


This exception is the fourth highest priority 
exception. When this exception is created, the 
interrupt mechanism waits for all other possible 
exceptions to be reported. It then generates this 
interrupt if no higher priority exception exists 
when the interrupt is to be generated. 
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5. External 


This exception is the fifth highest priority excep- 
tion. When this exception is created, the interrupt 
mechanism waits for all other possible exceptions 
to be reported. It then generates this interrupt if 
no higher priority exception exists when the inter- 
rupt is to be generated. 


. Decrementer 


This exception is the lowest priority exception. 
When this exception is created, the interrupt 
mechanism waits for all other possible exceptions 
to be reported. It then generates this interrupt if 
no higher priority exception exists when the inter- 
rupt is to be generated. 
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Chapter 6. Timer Facilities 


6.1 Overview ................ 67 
6.2 Time Base ............... 67 
6.2.1 Writing the Time Base ....... 68 


6.1 Overview 


The Time Base and the Decrementer provide timing 
functions for the system. Both are volatile resources 
and must be initialized during startup. The mftb 
instruction is used to read the Time Base; the mtspr 
and mfspr instructions are used to write the Time 
Base and Decrementer and to read the Decrementer. 


Time Base (TB) 
The Time Base provides a long-period counter 
driven by an implementation-dependent fre- 
quency. 


Decrementer (DEC) 
The Decrementer, a counter that is updated at 
the same rate as the Time Base, provides a 
means of signaling an interrupt after a specified 
amount of time has elapsed unless 


m the Decrementer is altered by software in the 
interim, or 
m the Time Base update frequency changes. 


6.2 Time Base 


The Time Base (TB) is a 64-bit register (see 
Figure 29) containing a 64-bit unsigned integer that is 
incremented periodically. Each increment adds 1 to 
the low-order bit (bit 63). The frequency at which the 
integer is updated is implementation-dependent. 


TBU TBL 
0 32 63 


Field Description 
TBU Upper 32 bits of Time Base 
TBL Lower 32 bits of Time Base 


Figure 29. Time Base 


6.3 Decrementer ............. 69 
6.3.1 Writing and Reading the 
Decrementer ............... 69 


The Time Base is a hypervisor resource; see Section 
1.7, “Logical Partitioning (LPAR)” on page 4. 


There is no automatic initialization of the Time Base; 
system software must perform this initialization. 


The Time Base increments until its value becomes 
OxFFFF_FFFF_FFFF_FFFF (24 — 1). At the next incre- 
ment, its value becomes 0x0000 0000 0000 0000. 
There is no interrupt or other indication when this 
occurs. 


The period of the Time Base depends on the driving 
frequency. As an order of magnitude example, 
suppose that the CPU clock is 100 MHz and that the 
Time Base is driven by this frequency divided by 32. 
Then the period of the Time Base would be 


o 26x32 12 
Trg = 100 MHz > 5.90 x 10 “ seconds 


which is approximately 187,000 years. 


The Time Base must be implemented such that the 
following requirements are satisfied. 


1. Loading a GPR from the Time Base shall have no 
effect on the accuracy of the Time Base. 


2. Storing a GPR to the Time Base shall replace the 
value in the Time Base with the value in the GPR. 


The PowerPC AS Architecture does not specify a 
relationship between the frequency at which the Time 
Base is updated and other frequencies, such as the 
CPU clock or bus clock in a PowerPC AS system. The 
Time Base update frequency is not required to be 
constant. What is required, so that system software 
can keep time of day and operate interval timers, is 
one of the following. 


m The system provides an (implementa- 
tion-dependent) interrupt to software whenever 
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the update frequency of the Time Base changes, 
and a means to determine what the current 
update frequency is. 


m The update frequency of the Time Base is under 
the control of the system software. 


Implementations must provide a means for either pre- 
venting the Time Base from incrementing or pre- 
venting it from being read in problem state 
(MSRpp=1). If the means is under software control, it 
must be accessible only in hypervisor state 
(MSRuy pr = 0b10). There must be a method for 
getting all processors' Time Bases to start incre- 
menting with values that are identical or almost iden- 
tical in all processors. 


Architecture Note 


Disabling the Time Base or making the mftb 
instruction privileged prevents the Time Base 
from being used to implement a “covert channel” 
in a secure system. 


The requirements stated above for the Time Base 
apply also to any other SPRs that measure time 
and can be read in problem state (e.g., Perform- 
ance Monitor registers). 


Programming Note 


If the hypervisor initializes the Time Base on 
power-on to some reasonable value and the 
update frequency of the Time Base is constant, 
the Time Base can be used as a source of values 
that increase at a constant rate, such as for time 
stamps in trace entries. 


Even if the update frequency is not constant, 
values read from the Time Base are 
monotonically increasing (except when the Time 
Base wraps from 264-1 to 0). If a trace entry is 
recorded each time the update frequency 
changes, the sequence of Time Base values can 
be post-processed to become actual time values. 


Successive readings of the Time Base may return 
identical values. 


See the description of the Time Base in Book Il, 
PowerPC AS Virtual Environment Architecture for 
ways to compute time of day in POSIX format 
from the Time Base. 


Architecture Note 


It is intended that the Time Base be useful for 
timing reasonably short sequences of code (a few 
hundred instructions) and for low-overhead time 
stamps for tracing. The Time Base should not 
“tick” faster than the CPU instruction clock. 
Driving the Time Base directly from the CPU 
instruction clock is probably finer granularity than 
necessary; the instruction clock divided by 8, 16, 
or 32 would be more appropriate. 


The Time Base driving frequency is also used to 
update the Decrementer (see Section 6.3), which 
is used by system software to set interval timers 
(“alarms”). The update frequency chosen should 
be appropriate for this purpose as well. 


Engineering Note 


One method that can be used to meet the require- 
ment to synchronize Time Base values in all 
processors is to have a TB Enable input signal. 
When this signal is active, the Time Base is 
allowed to increment. When this signal is inac- 
tive, the Time Base does not increment. This 
signal may also be used to satisfy the require- 
ment either to prevent the Time Base from incre- 
menting or to prevent the Time Base from being 
read in problem state. 


If the TB Enable input signal is implemented, the 
Decrementer does not decrement when this signal 
is inactive. 


6.2.1 Writing the Time Base 


Writing the Time Base is privileged, and can be done 
only in hypervisor state. Reading the Time Base is 
not privileged; it is discussed in Book II, PowerPC AS 
Virtual Environment Architecture. 


It is not possible to write the entire 64-bit Time Base 
using a single instruction. The mttbl and mttbu 
extended mnemonics write the lower and upper 
halves of the Time Base (TBL and TBU), respectively, 
preserving the other half. These are extended mne- 
monics for the mtspr instruction; see page 84. 


The Time Base can be written by a sequence such as: 


# load 64-bit value for 
TB into Rx and Ry 


lwz Rx, upper 
lwz Ry,lomer + 


li Rz,0 

mttbl Rz t force TBL to 0 
mttbu Rx + set TBU 

mttbl Ry + set TBL 


Provided that no interrupts occur while the last three 
instructions are being executed, loading O into TBL 
prevents the possibility of a carry from TBL to TBU 
while the Time Base is being initialized. 
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Programming Note 


The instructions for writing the Time Base are 
mode-independent. Thus code written to set the 
Time Base will work correctly in either 64-bit or 
32-bit mode. 


6.3 Decrementer 


The Decrementer (DEC) is a 32-bit decrementing 
counter that provides a mechanism for causing a 
Decrementer interrupt after a programmable delay. 


DEC 
0 31 


Figure 30. Decrementer 


The Decrementer is driven by the same frequency as 
the Time Base. The period of the Decrementer will 
depend on the driving frequency, but if the same 
values are used as given above for the Time Base 
(see Section 6.2), and if the Time Base update fre- 
quency is constant, the period would be 


o 282x32 3 
Tec = 100 MHZ ~ 1.37 x 10º seconds 


which is approximately 23 minutes. 


The Decrementer counts down, causing an interrupt 
(unless masked) when passing through zero. The 
Decrementer must be implemented such that the fol- 
lowing requirements are satisfied. 


1. The operation of the Time Base and the 
Decrementer is coherent, i.e., the counters are 
driven by the same fundamental time base. 


2. Loading a GPR from the Decrementer shall have 
no effect on the accuracy of the Decrementer. 


3. Storing a GPR to the Decrementer shall replace 
the value in the Decrementer with the value in 
the GPR. 


4. Whenever bit O of the Decrementer changes from 
0 to 1, an interrupt request is signaled. If mul- 
tiple Decrementer interrupt requests are received 
before the first can be reported, only one inter- 
rupt is reported. The occurrence of a 
Decrementer interrupt cancels the request. 


5. If the Decrementer is altered by software and the 
contents of bit O are changed from O to 1, an 
interrupt request is signaled. 


Programming Note 


In systems that change the Time Base update fre- 
quency for purposes such as power management, 
the Decrementer input frequency will also change. 
Software must be aware of this in order to set 
interval timers. 


6.3.1 Writing and Reading the 
Decrementer 


The contents of the Decrementer can be read or 
written using the mfspr and mtspr instructions, both 
of which are privileged when they refer to the 
Decrementer. Using an extended mnemonic (see 
page 84), the Decrementer can be written from GPR 
Rx using: 


mtdec Rx 


Programming Note 


If the execution of the mtdec instruction causes 
bit O of the Decrementer to change from 0 to 1, an 
interrupt request is signaled. 


The Decrementer can be read into GPR Rx using: 
mídec Rx 


Copying the Decrementer to a GPR has no effect on 
the Decrementer contents or on the interrupt mech- 
anism. 
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Chapter 7. Synchronization Requirements for Special 
Registers and for Lookaside Buffers 


Changing the contents of certain System Registers 
and of SLB entries, and invalidating SLB and TLB 
entries, can have the side effect of altering the 
context in which data addresses and instruction 
addresses are interpreted, and in which instructions 
are executed and data accesses are performed. For 
example, changing MSR), from O to 1 has the side 
effect of enabling translation of instruction addresses. 
These side effects need not occur in program order, 
and therefore may require explicit synchronization by 
software. (Program order is defined in Book Ill, 
PowerPC AS Virtual Environment Architecture.) 


An instruction that alters the context in which data 
addresses or instruction addresses are interpreted, or 
in which instructions are executed or data accesses 
are performed, is called a context-altering instruction. 
This chapter covers all the  context-altering 
instructions. The software synchronization required 
for them is shown in Table 1 (for data access) and 
Table 2 (for instruction fetch and execution). 


The notation “CSI” in the tables means any context 
synchronizing instruction (i.e., sc, isync, or rfid). A 
context synchronizing interrupt (i.e. any interrupt 
except non-recoverable System Reset or non- 
recoverable Machine Check) can be used instead of a 
context synchronizing instruction. If it is, phrases like 
“the synchronizing instruction”, below, should be 
interpreted as meaning the instruction at which the 
interrupt occurs. If no software synchronization is 
required before (after) a context-altering instruction, 
“the synchronizing instruction before (after) the 
context-altering instruction” should be interpreted as 
meaning the context-altering instruction itself. 


The synchronizing instruction before the context- 
altering instruction ensures that all instructions up to 
and including that synchronizing instruction are 
fetched and executed in the context that existed 
before the alteration. The synchronizing instruction 
after the context-altering instruction ensures that all 
instructions after that synchronizing instruction are 
fetched and executed in the context established by 
the alteration. Instructions after the first synchro- 
nizing instruction, up to and including the second syn- 
chronizing instruction, may be fetched or executed in 
either context. 


If a sequence of instructions contains context-altering 
instructions and contains no instructions that are 
affected by any of the context alterations, no software 
synchronization is required within the sequence. 


Programming Note 


Sometimes advantage can be taken of the fact 
that certain instructions that occur naturally in the 
program, such as the rfid at the end of an inter- 
rupt handler, provide the required synchroniza- 
tion. 


No software synchronization is required before 
altering the MSR using mtmsr[d] (except perhaps 
when altering the LE bit; see the tables), because 
mimsr| d] is execution synchronizing. No software 
synchronization is required before most of the other 
alterations shown in Table 2, because all instructions 
before the context-altering instruction are fetched and 
decoded before the context-altering instruction is exe- 
cuted (the processor must determine whether any of 
the preceding instructions are context synchronizing). 
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Instruction or Required Required Notes 

Event Before After 

interrupt none none 

rfid none none 

sc none none 

Trap none none 

mimsrd (SF) none CSI 

mimsrl d] (ILE) none none 

mtmsrl d] (PR) none CSI 

mtmsrl d] (DR) none CSI 

mimsrl d] (LE) = = 1 

mtsr{ in] CSI CSI 

mtspr (ACCR) CSI CSI 

mtspr (SDR1) sync CSI 4,5 

mtspr (DABR) = — 3 

mtspr (EAR) CSI CSI 

slbie CSI CSI 

slbia CSI CSI 

slbmte CSI CSI 11 

tlbie CSI sync 6, 7 

tibia CSI sync 6 

Table 1. Synchronization requirements for data 
access 

Notes: 


1. 


Synchronization requirements for changing from 
one Endian mode to the other using the mtmsr{ d] 
instruction are implementation-dependent, and 
are specified in the Book IV, PowerPC AS Imple- 
mentation Features document for the implementa- 
tion. 


The effect of changing the EE bit is immediate. 


m |f an mtmsr{d] instruction sets the EE bit to 
0, neither an External interrupt nor a 
Decrementer interrupt occurs after the 
mtmsr{[ d] is executed. 


m |f an mtmsr[d] instruction changes the EE bit 
from 0 to 1 when an External, Decrementer, 
or higher priority exception exists, the corre- 
sponding interrupt occurs immediately after 
the mtmsr[d] is executed, and before the 
next instruction is executed in the program 
that set EE to 1. 


Synchronization requirements for changing the 
Data Address Breakpoint Register are implemen- 
tation-dependent, and are specified in the Book 
IV, PowerPC AS Implementation Features docu- 
ment for the implementation. 


Instruction or Required Required Notes 
Event Before After 

interrupt none none 

rfid none none 

sc none none 

Trap none none 

mimsrd (SF) none CSI 8 
mimsrl d] (ILE) none none 

mimsrl d] (EE) none none 2 
mtmsrl d] (PR) none CSI 9 
mtmsril d] (FP) none CSI 
mimsr| d] (FEO,FE1) none CSI 

mtmsrl d] (SE, BE) |none CSI 

mtmsri d] (IR) none CSI 9 
mimsrl d] (RI) none none 

mimsrl d] (LE) — — 1 
mtsr{ in] none CSI 9 
mtspr (SDR1) sync CSI 4,5 
mtspr (DEC) none none 10 
mtspr (CTRL) none none 

slbie none CSI 

slbia none CSI 

slbmte none CSI 9, 11 
tlbie none CSI or sync |6, 7 
tibia none CSI or sync |6 


Table 2. Synchronization requirements for instruction 


fetch and/or execution 


4. SDR1 must not be altered when MSRpp= 1 
MSRjp=1; if it is, the results are undefined. 


or 


Architecture Note 


Altering SDR1 when MSRp=1 is prohibited 
because synchronizing Reference bit updates 
and instruction fetches, based on the old and 
new contents of SDR1, would be complex for 
both hardware and software. Altering SDR1 
when MSRpp=1 is prohibited because the 
capability is deemed not to be sufficiently 
useful to software to warrant verifying it. 


For most operating systems, SDR1 is 
expected not to be altered after it has been 
initialized. Therefore there is no need to 


support altering it quickly. 


5. A sync instruction is required before the mtspr 


instruction because SDR1 identifies the Page 
Table and thereby the location of Reference and 
Change bits. To ensure that Reference and 
Change bits are updated in the correct Page 
Table, SDR1 must not be altered until all Refer- 
ence and Change bit updates associated with 
address translations that were performed, by the 
processor executing the mtspr instruction, before 
the mtspr instruction is executed have been per- 
formed with respect to that processor. A sync 
instruction guarantees this synchronization of 
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Reference and Change bit updates, while neither 
a context synchronizing operation nor the instruc- 
tion fetching mechanism does so. 


Architecture Note 


The reasoning given above might suggest 
that the architecture should require a sync 
instruction before alteration of MSRgp and 
SLB entries (including alteration of SLB 
entries caused by mtsr[in] instructions; see 
Section 9.1.2), because they can affect the 
mapping of effective addresses to virtual 
addresses and hence can affect which PTE a 
given effective address maps to. But it is 
unlikely that any implementation would use 
MSRsf or an SLB entry more than once per 
storage access, whereas some implementa- 
tions might use SDR1 twice — once to deter- 
mine the real address for the access and 
once to set the Reference and Change bits. 
Therefore the architecture does not require a 
sync instruction before alteration of MSRg- 
and SLB entries. 


(An implementation might use SDR1 twice per 
storage access by keeping in the store queue 
entry for the Reference and Change bit 
update only the offset in the Page Table of 
the corresponding PTE, instead of the PTE's 
real address, and then adding the Page Table 
address from SDR1 to the offset to determine 
where to update the Reference and Change 
bits.) 


6. For data accesses, the context synchronizing 


instruction before the tlbie or tlbia instruction 
ensures that all preceding instructions that 
access data storage have completed to a point at 
which they have reported all exceptions they will 
cause. 


A context synchronizing instruction after the tlbie 
or tibia ensures that storage accesses associated 
with fetching instructions following the tlbie or 
tibia will not use the TLB entry(s) being invali- 
dated. A sync instruction (or a context synchro- 
nizing instruction) after the tlbie or tibia has the 
corresponding effect for data accesses. A sync 
instruction also ensures that all storage accesses 
associated with instructions preceding the sync 
instruction, and all Reference and Change bit 
updates associated with address translations that 
were performed, by the processor executing the 
sync instruction, before the sync instruction is 
executed, will be performed with respect to any 
processor or mechanism, to the extent required 
by the associated Memory Coherence Required 
attributes, before any data accesses caused by 
instructions following the sync instruction are per- 
formed with respect to that processor or mech- 


anism. If effects described in both the first and 
the third sentences of this paragraph are needed, 
both a context synchronizing instruction and a 
sync instruction must be used. 


Section 4.11, “Page Table Update Synchronization 
Requirements” on page 49 gives examples of the 
synchronization required when using tlbie in a 
sequence that alters a Page Table Entry. 


Programming Note 


The following sequence illustrates why it is 
necessary to ensure that all instructions that 
precede the tlbie or tlbia and access data 
storage have completed to a point at which 
they have reported all exceptions they will 
cause. Assume that valid SLB and Page 
Table entries exist for the target storage 
location when the sequence starts. 


1. A program issues a Load or Store 
instruction to a page. 

2. The same program marks the entry for 
the target page invalid in the Page Table. 

3. The same program executes a tlbie or 
tibia that invalidates the corresponding 
TLB entry. 

4. The Load or Store instruction finally exe- 
cutes, and gets a page fault. 


The page fault is semantically incorrect. In 
order to prevent it, a context synchronizing 
instruction must be executed between steps 1 
and 2. 


7. Multiprocessor systems have additional require- 


ments to synchronize “TLB shoot down” (i.e., to 
invalidate one or more TLB entries on all 
processors in the multiprocessor system and be 
able to ensure that the invalidations will have 
completed and that all side effects of the invali- 
dations will have taken effect before any data 
accesses caused by subsequent instructions are 
performed); see Section 4.11.1, “Page Table 
Updates” on page 49. 


. The alteration must not cause an implicit branch 


in effective address space. Thus the mtmsrd 
instruction and all subsequent instructions, up to 
and including the next context synchronizing 
instruction, must have effective addresses that 
are less than 232. 


. The alteration must not cause an implicit branch 


in real address space. Thus the real address of 
the context-altering instruction and of each sub- 
sequent instruction, up to and including the next 
context synchronizing instruction, must be inde- 
pendent of whether the alteration has taken 
effect. 
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10. The elapsed time between the contents of the 


11. 


Decrementer becoming negative and the sig- 
naling of the Decrementer exception is not 
defined. 


If an sibmte instruction alters the mapping, or 
associated attributes, of a currently mapped 
ESID, the s/bmte must be preceded by an sibie 
(or sibia) instruction that invalidates the existing 
translation. This applies even if the corre- 
sponding entry is no longer in the SLB (the trans- 
lation may still be in implementation-specific 
address translation lookaside information). No 


software synchronization is needed between the 
slbie and the slbmte, regardless of whether the 
index of the SLB entry (if any) containing the 
current translation is the same as the SLB index 
specified by the s/bmte. 


No slbie (or slbia) is needed if the s/bmte instruc- 
tion replaces a valid SLB entry with a mapping of 
a different ESID (e.g., to satisfy an SLB miss). 
However, the slbie is needed later if and when 
the translation that was contained in the replaced 
SLB entry is to be invalidated. 
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Chapter 8. Optional Facilities and Instructions 


8.1 External Control ........... 15 8.3 Real Mode Storage Control .... 77 
8.1.1 External Access Register ..... 75 8.4 Move to Machine State Register 
8.1.2 External Access Instructions ... 75 Instruction: ii rs ai ee a E mate E 78 
8.2 Data Address Breakpoint ...... 76 
The facilities and instructions described in this fel m» I[RID| 
chapter are optional. An implementation may provide 0 oa 
all, some, or none of them. 
Bit(s) Name Description 
0 E Enable bit 
26:31 RID Resource ID 


8.1 External Control 


The External Control facility permits a program to 
communicate with a special-purpose device. The 
facility consists of a Special Purpose Register, called 
EAR, and two instructions, called External Control In 
Word Indexed (eciwx) and External Control Out Word 
Indexed (ecowx). 


This facility must provide a means of synchronizing 
the devices with the processor to prevent the use of 
an address by the device when the translation that 
produced that address is being invalidated. 


Engineering Note 


Synchronization of devices with respect to tlbie 
instructions previously executed by the processor 
can be provided by tlbsync. 


8.1.1 External Access Register 


This 32-bit Special Purpose Register controls access 
to the External Control facility and, for external 
control operations that are permitted, identifies the 
target device. 


All other fields are reserved. 
Figure 31. External Access Register 


The EAR is a hypervisor resource; see Section 1.7, 
“Logical Partitioning (LPAR)” on page 4. 


The high-order bits of the RID field that correspond to 
bits of the Resource ID beyond the width of the 
Resource ID supported by the implementation are 
treated as reserved bits. 


Programming Note 


The hypervisor can use the EAR to control which 
programs are allowed to execute External Access 
instructions, when they are allowed to do so, and 
which devices they are allowed to communicate 
with using these instructions. 


8.1.2 External Access Instructions 


The External Access instructions, External Control In 
Word Indexed (eciwx) and External Control Out Word 
Indexed (ecowx), are described in Book Il, PowerPC 
AS Virtual Environment Architecture. Additional infor- 
mation about them is given below. 


If attempt is made to execute either of these 
instructions when EAR-=0, a Data Storage interrupt 
occurs with bit 11 of the DSISR set to 1. 


The instructions are supported whenever MSRpp=1. 
If either instruction is executed when MSRpp= 0 (real 
addressing mode), the results are boundedly unde- 
fined. 
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8.2 Data Address Breakpoint 


The Data Address Breakpoint facility provides a 
means of detecting load and store accesses to a des- 
ignated doubleword. The address comparison is done 
on an effective address, and is done independent of 
whether address translation is enabled or disabled. 


The Data Address Breakpoint facility is controlled by 
the Data Address Breakpoint Register (DABR). 


DAB BTIDWDR 
0 61 63 


Bit(s) Name Description 


0:60 DAB Data Address Breakpoint 

61 BT Breakpoint Translation Enable 
62 DW Data Write Enable 

63 DR Data Read Enable 


Figure 32. Data Address Breakpoint Register 


The DABR is a hypervisor resource; see Section 1.7, 
“Logical Partitioning (LPAR)” on page 4. 


A Data Address Breakpoint match occurs for a Load 
or Store instruction if, for any byte accessed, 


| EAo:60=DABR pap; and 

= MSRpp=DABR py, and 

m the instruction is a Store and DABRpw=1, or the 
instruction is a Load and DABRpp=1. 


In 32-bit mode the high-order 32 bits of the EA are 
treated as zeros for the purpose of detecting a match. 


If the above conditions are satisfied, a match also 
occurs for eciwx and ecowx. For the purpose of 
determining whether a match occurs, eciwx is treated 
as a Load, and ecowx is treated as a Store. 


If the above conditions are satisfied, it is undefined 
whether a match occurs in the following cases. 


m The instruction is Store Conditional but the store 
is not performed. 

m The instruction is a Load/Store String of zero 
length. 

m The instruction is dcbz. (For the purpose of 
determining whether a match occurs, dcbz is 
treated as a Store.) 


The Cache Management instructions other than debz 
never cause a match. 


A Data Address Breakpoint match causes a Data 
Storage exception (see Section 5.5.3, “Data Storage 
Interrupt” on page 56). If a match occurs, some or all 
of the bytes of the storage operand may have been 
accessed; however, if a Store or ecowx instruction 
causes the match, the storage operand is not altered 
if the instruction is one of the following: 


m any Store instruction that causes an atomic 
access 
m ecowx 


Programming Note 


The Data Address Breakpoint facility does not 
apply to instruction fetches. 


If a Data Address Breakpoint match occurs for a 
Load instruction for which any byte of the storage 
operand is in storage that is both Caching Inhib- 
ited and Guarded, or for an eciwx instruction, it 
may not be safe for software to restart the 
instruction. 


Engineering Note 


In the case of a DABR match, it is preferable not 
to access or alter any bytes of the storage 
operand at or after the breakpoint address. This 
makes the Data Address Breakpoint facility more 
useful for debugging. 


76 PowerPC AS Operating Environment Architecture, Tags Inactive Subset 


IBM Confidential - Feb. 24, 1999 


8.3 Real Mode Storage Control 


The Real Mode Storage Control facility provides a 
means of specifying portions of real storage that are 
treated as non-Guarded in real addressing mode 
(MSR;p=0 or MSRpa=0, as appropriate for the type 
of access). The remaining portions are treated as 
Guarded in real addressing mode (as is all of storage 
on implementations that do not provide this means). 
The means is a hypervisor resource (see Section 1.7, 
“Logical Partitioning (LPAR)” on page 4), and may 
also be system-specific. 


The facility does not apply to implicit accesses to the 
Page Table by the hardware in performing address 
translation or recording reference and change infor- 
mation. These accesses are performed as described 
in Section 4.2.6, “Real Addressing Mode” on page 26. 


Programming Note 


The preceding capability can be used to improve 
the performance of software that runs in real 
addressing mode, by causing accesses to 
instructions and data that occupy well-behaved 
storage to be treated as non-Guarded. Because 
in real addressing mode all storage is not Caching 
Inhibited, software should not map a Caching 
Inhibited virtual page to storage that is treated as 
non-Guarded in real addressing mode. Doing so 
could permit storage locations in the virtual page 
to be copied into the cache, which could lead to 
violations of the requirement given in Section 
4.7.2 for changing the value of the | bit. 


Engineering Note 


The preceding capability should be provided at 
sufficiently fine granularity that the operating 
system can specify that kernel space (code and 
data) is treated as non-Guarded in real 
addressing mode and can map all application 
space to real storage that is treated as Guarded 
in real addressing mode. (This is necessary in 
order to prevent application code or data from 
being fetched into a cache when address trans- 
lation is disabled.) A simple way to provide the 
capability is to use an implementation-specific 
SPR to specify the boundary between non- 
Guarded and Guarded real storage; any address 
below the value contained in the SPR would be 
treated as non-Guarded, and any address above 
the value (inclusive) would be treated as Guarded. 
It is permissible to treat any boundaries thus pro- 
vided as if they were protection boundaries, with 
respect to causing an Alignment interrupt if an 
access crosses a boundary. 
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8.4 Move to Machine State Register Instruction 


Move To Machine State Register X-form 


mimsr RS 


31 RS WI WI 146 / 
0 6 11 16 21 31 


MSRa2:50 52:57 60:63 € (RS) 32:50 52:57 60:63 


The result of ORing bits 58 and 49 of register RS is 
placed into MSRsg. The result of ORing bits 59 and 49 
of register RS is placed into MSRso. Bits 32:50, 52:57, 
and 60:63 of register RS are placed into the corre- 
sponding bits of the MSR. The high-order 32 bits of 
the MSR are unchanged. 


RS 
RS 


This instruction is privileged. This instruction is exe- 
cution synchronizing except with respect to alter- 
ations to the LE bit; see Chapter 7, “Synchronization 
Requirements for Special Registers and for Lookaside 
Buffers” on page 71. 


In addition, alterations to the EE and RI bits are effec- 
tive as soon as the instruction completes. Thus if 
MSRee=0 and an External or Decrementer interrupt 
is pending, executing an mtmsr instruction that sets 
MSRe¢ to 1 will cause the External or Decrementer 
interrupt to be taken before the next instruction is 
executed, if no higher priority exception exists (see 
Section 5.8, “Interrupt Priorities” on page 65). 


Special Registers Altered: 
MSR 


Programming Note 


If this instruction sets MSRpp to 1, it also sets 
MSR ig and MSRpp to 1. 


This instruction does not alter MSRye. (This 
instruction does not alter MSRyy because it does 
not alter any of the high-order 32 bits of the 
MSR.) 


Programming Note 


For a discussion of software synchronization 
requirements when altering certain MSR bits, see 
Chapter 7. 


Programming Note 


There is no need for an analogous version of the 
mfmsr instruction, because the existing instruction 
copies the entire contents of the MSR to the 
selected GPR. 
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Chapter 9. Optional Facilities and Instructions that are being 


Phased Out of the Architecture 


9.1 Bridge to SLB Architecture 
9.1.1 Address Space Register 


The facilities and instructions described in this 
chapter are optional. An implementation may provide 
all, some, or none of them. 


Warning: These facilities and instructions are being 
phased out of the architecture. 


The facilities and instructions described in this 
chapter are generally not mentioned elsewhere in 
Books | — Ill. Any conflict between this chapter and 
other parts of the Books is deemed to be resolved in 
favor of this chapter. 


9.1 Bridge to SLB Architecture 


The facility described in this section can be used to 
ease the transition to the current PowerPC AS 
software-managed Segment Lookaside Buffer (SLB) 
architecture, from either the Segment Register archi- 
tecture provided by 32-bit PowerPC implementations 
or the hardware-accessed Segment Table architecture 
provided by 64-bit PowerPC implementations and by 
earlier PowerPC AS implementations. 


The facility permits the operating system to continue 
to use the 32-bit PowerPC implementation's Segment 
Register Manipulation instructions, and to continue to 
use the Address Space Register (ASR). 


Programming Note 


Warning: This facility is being phased out of the 
architecture. It is likely not to be supported on 
future implementations. New programs should 
not use it. 


9.1.2 Segment Register Manipulation 
Instructions 


Engineering Note 


Decisions regarding whether to implement this 
facility in a given implementation, and how well to 
make it perform there, must include consideration 
of migration plans for existing software that uses 
it. 


9.1.1 Address Space Register 


The ASR is a 64-bit Special Purpose Register pro- 
vided for operating system use. 


ASR 
0 63 


Figure 33. Address Space Register 


Programming Note 


The ASR can be used to point to a Segment 
Table. 


On earlier PowerPC AS implementations and on 
64-bit PowerPC implementations, bits 0:51 of the 
ASR contained the high-order 52 bits of the 64-bit 
real address of the Segment Table, and bit 63 of 
the ASR indicated whether the specified Segment 
Table should (bit 63 = 1) or should not (bit 63 = 
0) be searched by the processor when doing 
address translation. 
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9.1.2 Segment Register Manipulation Instructions 


The instructions described in this section — misr, 
mtsrin, mfsr, and mfsrin — allow software to associate 
effective segments 0 through 15 with any of virtual 
segments 0 through 22” 1. SLB entries 0:15 serve as 
virtual Segment Registers, with SLB entry i used to 
emulate Segment Register i. The misr and misrin 
instructions move 32 bits from a selected GPR to a 
selected SLB entry. The mfsr and mfsrin instructions 
move 32 bits from a selected SLB entry to a selected 
GPR. 


The contents of the GPRs used by the instructions 
described in this section are shown in Figure 34. 
Fields shown as zeros must be zero for the Move To 
Segment Register instructions. Fields shown as 
hyphens are ignored. Fields shown as periods are 
ignored by the Move To Segment Register 
instructions and set to zero by the Move From 
Segment Register instructions. Fields shown as 
colons are ignored by the Move To Segment Register 
instructions and set to undefined values by the Move 
From Segment Register instructions. 


RS/RT 

0 31 33 35 37 63 
RB 

[ees Jew] ] | Esio | 
0 32 35 63 


Figure 34. GPR contents for mtsr, misrin, mfsr, and 
mfsrin 


Programming Note 


The “Segment Register” format used by the 
instructions described in this section corresponds 
to the low-order 32 bits of RS and RT shown in 
the figure. This format is essentially the same as 
that for the Segment Registers of 32-bit PowerPC 
implementations. The only differences are the fol- 
lowing. 


m Bit 36 corresponds to a reserved bit in 
Segment Registers. Software must supply 0 
for the bit because it corresponds to the L bit 
in SLB entries, and large pages are not sup- 
ported for SLB entries created by the Move 
To Segment Register instructions. 


m VSID bits 25:27 correspond to reserved bits in 
Segment Registers. Software can use these 
extra VSID bits to create VSIDs that are 
larger than those supported by the Segment 
Register Manipulation instructions of 32-bit 
PowerPC implementations. 


Bit 32 of RS and RT corresponds to the T (direct- 
store) bit of early 32-bit PowerPC implementa- 
tions. No corresponding bit exists in SLB entries. 


Programming Note 


The Programming Note in the introduction to 
Section 4.10.2.1, “SLB Management Instructions” 
on page 42 applies also to the Segment Register 
Manipulation instructions described in this 
section, and to any combination of the instructions 
described in the two sections, except as specified 
below for mfsr and mfsrin. 


The requirement that the SLB contain at most one 
entry that translates a given effective address 
(see Section 4.4.1, “Segment Lookaside Buffer 
(SLB)” on page 29) applies to SLB entries created 
by mtsr and mtsrin. This requirement is satisfied 
naturally if only mtsr and mtsrin are used to 
create SLB entries for a given ESID, because for 
these instructions the association between SLB 
entries and ESID values is fixed (SLB entry i is 
used for ESID i). However, care must be taken if 
slbmie is also used to create SLB entries for the 
ESID, because for slbmte the association between 
SLB entries and ESID values is specified by soft- 
ware. 
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Move To Segment Register X-form 


mtsr SR,RS 


31 RS |/| SR MH 210 / 
0 6 11] 12 16 21 31 


The SLB entry specified by SR is loaded from register 
RS, as follows. 


SLBE Bit(s) Set to SLB Field(s) 


0:31 0x0000 0000 ESIDos: 

32:35 SR ESID3 35 

36 Obi 

37:61 0x00 0000 |0b0  VSIDoz4 

62:88 (RS)37:63 VSID 25:51 

89:91 (RS) 39.96 N 

92 (RS) L ((RS)3g must be 0b0) 
93 0b0 C 


MSRsp must be O when this instruction is executed; 
otherwise the results are boundedly undefined. 


This instruction is privileged. 


Special Registers Altered: 
None 


Architecture Note 


The requirement that the Segment Register 
Manipulation instructions be executed only in 
32-bit mode permits normal EA computation (in 
which the high-order 32 bits of the result are 
treated as zeros in 32-bit mode but not in 64-bit 
mode) to be used for mtsrin and mfsrin. 


Move To Segment Register Indirect 
X-form 


mtsrin RS,RB 
[POWER mnemonic: mtsri] 


The SLB entry specified by (RB)s5.35 is loaded from 
register RS, as follows. 


SLBE Bit(s) Set to SLB Field(s) 


0:31 0x0000_0000 ESIDo.34 

32:35 (RB)a2:35 ESID32:35 

36 0b1 V 

37:61 0x00_0000|0b0  VSIDoz4 

62:88 (RS)s7:63 VSID25:51 

89:91 (RS) 39.96 KKN 

92 (RS) 5 L ((RS)gg must be 0b0) 
93 0b0 C 


MSRsp must be O when this instruction is executed; 
otherwise the results are boundedly undefined. 


This instruction is privileged. 


Special Registers Altered: 
None 
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Move From Segment Register X-form 


mfsr RT,SR 


31 RT |/| SR VII 595 / 
0 6 11] 12 16 21 31 


The contents of the low-order 27 bits of the VSID field, 
and the contents of the Ks, K,, N, and L fields, of the 
SLB entry specified by SR are placed into register RT, 
as follows. 


SLBE Bit(s) Copied to SLB Field(s) 


89:91 RT 33-35 K,K,N 
92 RT 36 L (SLBE, must be 0b0) 


RTapis set to 0. The contents of RT9.3, are undefined. 


MSRsp must be O when this instruction is executed; 
otherwise the results are boundedly undefined. 


This instruction must be used only to read an SLB 
entry that was, or could have been, created by misr 
or mtsrin and has not subsequently been invalidated 
(i.e., an SLB entry in which ESID< 16, V=1, VSID< 227, 
L=0, and C=0). Otherwise the contents of register 
RT are undefined. 


This instruction is privileged. 


Special Registers Altered: 
None 


Move From Segment Register Indirect 
X-form 


mfsrin RT,RB 


31 RT WII RB 659 / 
0 6 11 16 21 31 


The contents of the low-order 27 bits of the VSID field, 
and the contents of the Ks, Kp, N, and L fields, of the 
SLB entry specified by (RB)30.35 are placed into reg- 
ister RT, as follows. 


SLBE Bit(s) Copied to SLB Field(s) 


89:91 RT 33:35 
92 RT 36 L (SLBE, must be 0b0) 


RTs% is set to 0. The contents of RT9.3, are undefined. 


MSRs must be O when this instruction is executed; 
otherwise the results are boundedly undefined. 


This instruction must be used only to read an SLB 
entry that was, or could have been, created by misr 
or mtsrin and has not subsequently been invalidated 
(i.e., an SLB entry in which ESID< 16, V=1, VSID< 227, 
L=0, and C=0). Otherwise the contents of register 
RT are undefined. 


This instruction is privileged. 


Special Registers Altered: 
None 
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Appendix A. Assembler Extended Mnemonics 


In order to make assembler language programs simpler to write and easier to understand, a set of extended 
mnemonics and symbols is provided for certain instructions. This appendix defines extended mnemonics and 
symbols related to instructions defined in Book III. 


Assemblers should provide the extended mnemonics and symbols listed here, and may provide others. 


Appendix A. Assembler Extended Mnemonics 83 


IBM Confidential - Feb. 24, 1999 


A.1 Move To/From Special Purpose Register Mnemonics 


This section defines extended mnemonics for the mtspr and mfspr instructions, including the Special Purpose Reg- 
isters (SPRs) defined in Book | and certain privileged SPRs, and for the Move From Time Base instruction defined 
in Book Il. 


The mtspr and mfspr instructions specify an SPR as a numeric operand; extended mnemonics are provided that 
represent the SPR in the mnemonic rather than requiring it to be coded as an operand. Similar extended mne- 
monics are provided for the Move From Time Base instruction, which specifies the portion of the Time Base as a 
numeric operand. 


Note: mftb serves as both a basic and an extended mnemonic. The Assembler will recognize an mftb mnemonic 
with two operands as the basic form, and an mftb mnemonic with one operand as the extended form. In the 
extended form the TBR operand is omitted and assumed to be 268 (the value that corresponds to TB). 


Table 3 (Page 1 of 2). Extended mnemonics for moving to/from an SPR 
Move To SPR Move From SPR! 
Extended Equivalent to Extended Equivalent to 


Special Purpose Register 


Fixed-Point Exception 


Register mtxer Rx mtspr 1,Rx mfxer Rx mfspr Rx,1 


Link Register mtlr Rx mtspr 8,Rx mflr Rx mfspr Rx,8 


Count Register mtspr 9,Rx mfctr Rx mfspr Rx,9 


Data Storage Interrupt 


Status Register mtdsisr Rx mtspr 18,Rx mfdsisr Rx mfspr Rx,18 
Data Address Register mtdar Rx mtspr 19,Rx mfdar Rx mfspr Rx,19 
Decrementer mtdec Rx mtspr 22,Rx mfdec Rx mfspr Rx,22 
ia A al mispr 25,Rx mfsdri Rx mfspr Rx,25 
Save/Restore Register 0 mtsrrO Rx mtspr 26,Rx mísrrO Rx mfspr Rx,26 
Save/Restore Register 1 mtsrri Rx mtspr 27,Rx mfsrri Rx mfspr Rx,27 
ACCR mtaccr Rx mtspr 29,Rx mfaccr Rx mfspr Rx,29 


CTRL mtctrl Rx mtspr 152,Rx mfctrl Rx mfspr Rx,136 


Special Purpose Registers 


GO through G3 mtsprg n,Rx mtspr 272+n,Rx mfsprg Rx,n mfspr Rx,272+n 
Time Base [Lower] mttbl Rx mtspr 284,Rx mftb Rx mftb Rx,268 
Time Base Upper mttbu Rx mtspr 285,Rx mftbu Rx mftb Rx,269 
Processor Version Register = mfpvr Rx mfspr Rx,287 
MMCRA mtmmcra Rx mtspr 786,Rx mfmmcra Rx mfspr Rx,770 
PMC1 mtpmci Rx mtspr 787,Rx mfpmci Rx mfspr Rx,771 
PMC2 mtpmc2 Rx mtspr 788,Rx mfpmc2 Rx mfspr Rx,772 
mtspr 789,Rx mfpmc3 Rx mfspr Rx,773 

PMC4 mtpmc4 Rx mtspr 790,Rx mfpmc4 Rx mfspr Rx,774 
PMC5 mtpmc5 Rx mtspr 791,Rx mfpmc5 Rx mfspr Rx,775 
PMC6 mtpmc6 Rx mtspr 792,Rx mfpmc6 Rx mfspr Rx,776 
mtspr 793,Rx mfpmc7 Rx mfspr Rx,777 
PMC8 mtpmc8 Rx mtspr 794,Rx mfpmc8 Rx mfspr Rx,778 

| MMCRO | mtmmcrO Rx | mtspr 795,Rx mîmmcro Rx mfspr Rx,779 


84 PowerPC AS Operating Environment Architecture, Tags Inactive Subset 


IBM Confidential - Feb. 24, 1999 


Table 3 (Page 2 of 2). Extended mnemonics for moving to/from an SPR 


é . Move To SPR Move From SPR! 
Special Purpose Register 
Extended Equivalent to Extended Equivalent to 


MMCRI mtmmer1 Rx mtspr 798,Rx mfmmer1 Rx mfspr Rx,782 


Processor Identification 


Register = = mfpir Rx mfspr Rx,1023 


'Except for mftb and mftbu. 


Programming Note 


The extended mnemonics in Table 3 for SPRs 
associated with the Performance Monitor facility 
are based on the definitions in Appendix E. 


Other versions of Performance Monitor facilities 
used different sets of SPR numbers (all 32-bit 
PowerPC processors used a different set, and 
some early PowerPC AS processors used yet a 
different set). 
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Appendix B. Cross-Reference for Changed POWER 


Mnemonics 


The following table lists the POWER instruction mne- 
monics that have been changed in the PowerPC AS 
Operating Environment Architecture, sorted by 
POWER mnemonic. 


To determine the PowerPC AS mnemonic for one of 
these POWER mnemonics, find the POWER mnemonic 
in the second column of the table: the remainder of 


the line gives the PowerPC AS mnemonic and the 
page on which the instruction is described, as well as 
the instruction names. 


POWER mnemonics that have not changed are not 
listed. POWER instruction names that are the same in 
PowerPC AS are not repeated: i.e., for these, the last 
column of the table is blank. 


POWER 
Page 


PowerPC AS 


Mnemonic Instruction 


Mnemonic Instruction 


81 |mtsri Move To Segment Register Indirect 
11 Isvca Supervisor Call 
47 |tlbi TLB Invalidate Entry 


mtsrin 
sc System Call 
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Appendix C. New Instructions 


The following instructions in the PowerPC AS Oper- 
ating Environment Architecture are new; they are not 
in the POWER Architecture. 


The following instructions are optional: tibia, tlbsync, 
mtmsr. In addition the following instructions may 
optionally be provided as part of a “bridge” facility as 
described in Section 9.1, “Bridge to SLB Architecture” 
on page 79: mfsr, mfsrin, mtsr, mtsrin. 


mfsrin Move From Segment Register Indirect 

mtmsrd Move To Machine State Register 
Doubleword 

rfid Return From Interrupt Doubleword 

slbia SLB Invalidate All 

slbie SLB Invalidate Entry 


slbmfee SLB Move From Entry ESID 
slbmfev SLB Move From Entry VSID 


slbmte SLB Move To Entry 
tibia TLB Invalidate All 
tlbsync TLB Synchronize 
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Appendix D. Interpretation of the DSISR as Set by an 


Alignment Interrupt 


For most causes of Alignment interrupt, the interrupt 
handler will emulate the interrupting instruction. To 
do this, it needs the following characteristics of the 
interrupting instruction: 


Load or store 

Length (halfword, word, or doubleword) 
String, multiple, or elementary 
Fixed-point or floating-point 

Update or non-update 

Byte reverse or not 

Is it debz? 


The PowerPC AS Architecture optionally provides this 
information by setting bits in the DSISR that identify 
the interrupting instruction type. It is not necessary 
for the interrupt handler to load the interrupting 
instruction from storage. The mapping is unique 
except for a few exceptions that are discussed below. 
The near-uniqueness depends on the fact that many 
instructions, such as the fixed- and floating-point 
arithmetic instructions and the one-byte loads and 
stores, cannot cause an Alignment interrupt. 


See Section 5.5.8, “Alignment Interrupt” on page 59 
for a description of how the opcode and extended 
opcode are mapped to a DSISR value for an X-, D-, or 
DS-form instruction that causes an Alignment inter- 
rupt. 


The table on the next page shows the inverse 
mapping: how the DSISR bits identify the interrupting 
instruction. The following notes are cited in the table. 


(1) The instructions Iwz and Iwarx give the same 
DSISR bits (all zero). But if Iwarx causes an 
Alignment interrupt, it should not be emulated. It 
is adequate for the Alignment interrupt handler 
simply to treat the instruction as if it were Iwz. 
The emulator must use the address in the DAR, 
rather than compute it from RA/RB/D, because 
Iwz and Iwarx have different instruction formats. 


If opcode O (“lllegal or Reserved”) can cause an 
Alignment interrupt, it will be indistinguishable to 
the interrupt handler from Iwarx and Iwz. 


(2) These are distinguished by DSISR bits 12:13, which 
are not shown in the table. 


The interrupt handler has no need to distinguish 
between an X-form instruction and the corresponding 
D- or DS-form instruction if one exists, and vice versa. 
Therefore two such instructions may yield the same 
DSISR value (all 32 bits). For example, stw and stwx 
may both yield either the DSISR value shown in the 
following table for stw, or that shown for stwx. 
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then it is or then it is or 
either D/DS- either D/DS- 

If DSISR | X-form form If DSISR | X-form form 
15:21 is: |opcode: opcode:|so the instruction is: 15:21 is: |opcode: opcode:|so the instruction is: 
00 O 0000 | 00000xxx00 | x00000 |Iwarx, Iwz, reserved 10 O 0000 |00000xxx10 

(1) 10 O 0001 | 00010xxx10 - 
00 O 0001 | 00010xxx00 | x00010 | Idarx 10 0 0010 | 00100xxx10 stwex. 
00 O 0010 | 00100xxx00 | x00100 | stw 10 O 0011 | 00110xxx10 stdex. 
00 0 0011 | 00110xxx00 | x00110 |- 10 0 0100 | 01000xxx10 - 
00 O 0100 | 01000xxx00 | x01000 | Ihz 10 0 0101 | 01010xxx10 
00 O 0101 | 01010xxx00 | x01010 |lha 10 O 0110 | 01100xxx10 
00 O 0110 | 01100xxx00 | x01100 |sth 10 0 0111 | 01110xxx10 - 
00 O 0111 |01110xxx00 | x01110 |Imw 10 0 1000 | 10000xxx10 Iwbrx 
00 O 1000 | 10000xxx00 | x10000 |lfs 10 0 1001 | 10010xxx10 - 
00 O 1001 | 10010xxx00 | x10010 |lfd 10 0 1010 | 10100xxx10 stwbrx 
00 O 1010 | 10100xxx00 | x10100 |stfs 10 0 1011 | 10110xxx10 - 
00 O 1011 | 10110xxx00 | x10110 | stfd 10 0 1100 | 11000xxx10 Ihbrx 
00 0 1100 | 11000xxx00 | x11000 |- 10 0 1101 | 11010xxx10 - 
00 O 1101 | 11010xxx00 | x11010 |ld, Idu, Iwa (2) 10 0 1110 | 11100xxx10 sthbrx 
00 O 1110 | 11100xxx00 | x11100 |- 10 0 1111 | 11110xxx10 - 
00 0 1111 | 11110xxx00 | x11110 |std, stdu (2) 10 1 0000 | 00001xxx10 
00 1 0000 | 00001xxx00 | x00001 |lwzu 10 1 0001 | 00011xxx10 
00 1 0001 | 00011xxx00 | x00011 |- 10 1 0010 | 00101xxx10 
00 1 0010 | 00101xxx00 | x00101 |stwu 10 1 0011 | 00111xxx10 - 
00 1 0011 |00111xxx00 | x00111 |- 10 1 0100 | 01001xxx10 eciwx 
00 1 0100 | 01001xxx00 | x01001 |lhzu 10 1 0101 |01011xxx10 - 
00 1 0101 /01011xxx00 | x01011 |lhau 10 1 0110 /01101xxx10 ecowx 
00 1 0110 |01101xxx00 | x01101 |sthu 10 1 0111 |01111xxx10 - 
00 1 0111/01111xxx00 | x01111 |stmw 10 1 1000 | 10001xxx10 
00 1 1000 | 10001xxx00 | x10001 |lfsu 10 1 1001 | 10011xxx10 
00 1 1001 | 10011xxx00 | x10011 |lfdu 10 1 1010 | 10101xxx10 
00 1 1010 | 10101xxx00 | x10101 |stfsu 10 1 1011 | 10111xxx10 
00 1 1011 | 10111xxx00 | x10111 |stfdu 10 1 1100 | 11001xxx10 
00 1 1100 | 11001xxx00 | x11001 |- 10 1 1101 | 11011xxx10 
00 1 1101 | 11011xxx00 | x11011 |- 10 1 1110 | 11101xxx10 - 
00 1 1110 | 11101xxx00 | x11101 |- 10 11111 | 11111xxx10 dcbz 
00 1 1111 | 11111xxx00 | x11111 |- 11 O 0000 |00000xxx11 lwzx 
01 O 0000 | 00000xxx01 Idx 11 0 0001 | 00010xxx11 - 
01 0 0001 | 00010xxx01 - 11 0 0010 | 00100xxx11 stwx 
01 0 0010 | 00100xxx01 stdx 11 0 0011 | 00110xxx11 - 
01 0 0011 | 00110xxx01 - 11 O 0100 | 01000xxx11 lhzx 
01 0 0100 | 01000xxx01 - 11 0 0101 |01010xxx11 lhax 
01 O 0101 |01010xxx01 Iwax 11 0 0110 /01100xxx11 sthx 
01 0 0110 | 01100xxx01 - 11 00111/01110xxx11 - 
01 0 0111 /01110xxx01 - 11 0 1000 | 10000xxx11 lfsx 
01 O 1000 | 10000xxx01 Iswx 11 0 1001 | 10010xxx11 Ifdx 
01 0 1001 | 10010xxx01 Iswi 11 0 1010 | 10100xxx11 stfsx 
01 0 1010 | 10100xxx01 stswx 11 0 1011 | 10110xxx11 stfdx 
01 0 1011 | 10110xxx01 stswi 11 0 1100 | 11000xxx11 - 
01 0 1100 | 11000xxx01 - 11 0 1101 | 11010xxx11 
01 0 1101 | 11010xxx01 - 11 0 1110 | 11100xxx11 - 
01 0 1110 | 11100xxx01 - 11 0 1111 | 11110xxx11 stfiwx 
01 01111 | 11110xxx01 - 11 1 0000 | 00001xxx11 Iwzux 
01 1 0000 | 00001xxx01 Idux 11 1 0001 |00011xxx11 - 
01 1 0001 |00011xxx01 - 11 1 0010 | 00101xxx11 stwux 
01 1 0010 | 00101xxx01 stdux 11 1 0011 /00111xxx11 - 
01 1 0011 |00111xxx01 - 11 1 0100 | 01001xxx11 lhzux 
01 1 0100 | 01001xxx01 - 11 1 0101 |01011xxx11 lhaux 
01 1 0101 |01011xxx01 Iwaux 11 1 0110 /01101xxx11 sthux 
01 1 0110 |01101xxx01 - 11 1 0111 |01111xxx11 - 
01 1 0111 |01111xxx01 - 11 1 1000 | 10001xxx11 lfsux 
01 1 1000 | 10001xxx01 - 11 1 1001 | 10011xxx11 Ifdux 
01 1 1001 | 10011xxx01 - 11 1 1010 | 10101xxx11 stfsux 
01 1 1010 | 10101xxx01 - 11 1 1011 | 10111xxx11 stfdux 
01 1 1011 | 10111xxx01 - 11 1 1100 | 11001xxx11 - 
01 1 1100 | 11001xxx01 - 11 1 1101 | 11011xxx11 
01 1 1101 | 11011xxx01 - 11 11110 | 11101xxx11 
01 11110 | 11101xxx01 - 11 11111 | 11111xxx11 
01 11111 | 11111xxx01 - 
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Appendix E. Example Performance Monitor (Optional) 


A Performance Monitor facility provides a means of 
collecting information about program and system per- 
formance. 


The resources (e.g., SPR numbers) that a Perform- 
ance Monitor facility may use are identified elsewhere 
in this Book. All other aspects of any Performance 
Monitor facility are implementation-dependent, and 
are described in the Book IV, PowerPC AS Implemen- 
tation Features document for the implementation. 


This appendix provides an example of a Performance 
Monitor facility. It is only an example; implementa- 
tions may provide all, some, or none of the features 
described here, or may provide features that are 
similar to those described here but differ in detail. 


Programming Note 


Because the features provided by a Performance 
Monitor facility are implementation-dependent, 
operating systems should provide services that 
support the useful performance monitoring func- 
tions in a generic fashion. Application programs 
should use these services, and should not depend 
on the features provided by a particular imple- 
mentation. 


The example Performance Monitor facility consists of 
the following features (described in detail in subse- 
quent sections). 


m one MSR bit 


— PMM (Performance Monitor Mark), which can 
be used to select one or more programs for 
monitoring 


m SPRs 


— PMC1 —  PMC8 (Performance Monitor 
Counter registers 1 — 8), which count events 


— MMCRO, MMCRI, and MMCRA (Monitor 
Mode Control Registers 0, 1, and A), which 
control the Performance Monitor facility 


— SIAR and SDAR (Sampled Instruction 
Address Register and Sampled Data Address 
Register), which contain the address of the 
“sampled instruction” and of the “sampled 
data” 


m the Performance Monitor interrupt, which can be 
caused by monitored conditions and events 


The minimal subset of these features that makes the 
resulting Performance Monitor useful to applications 
consists of MSRpyy, PMC1, PMC2, PMC3, PMC4, 
MMCRO, MMCR1, and MMCRA and certain bits of 
these three Monitor Mode Control Registers. These 
features support the counting of four selected events, 
and are identified as the “basic” features below. The 
remaining features (the remaining SPRs, the 
remaining bits in the three Monitor Mode Control 
Registers, and the Performance Monitor interrupt) are 
considered “extensions”. 


The events that can be counted in the PMCs are 
implementation-dependent. The Book IV, PowerPC 
AS Implementation Features document for the imple- 
mentation describes the events that are available for 
each PMC, and also the code that identifies each 
event. The events and codes may vary between 
PMCs, as well as between implementations. The 
event to be counted in a given PMC is selected by 
specifying the appropriate code in the MMCR 
“Selector” field for the PMC. As described in Book IV, 
some events may include operations that are per- 
formed out-of-order. 


Many aspects of the operation of the Performance 
Monitor are summarized by the following hierarchy, 
which is described starting at the lowest level. 


m A “counter negative condition” occurs when the 
value in a PMC is negative (i.e., when bit 0 of the 
PMC is 1). A “Time Base transition event” occurs 
when a selected bit of the Time Base changes 
from 0 to 1 (the bit is selected by an MMCR field). 
The term “condition or event” is used as an 
abbreviation for “counter negative condition or 
Time Base transition event”. A condition or 
event can be caused implicitly by the processor 
(e.g., incrementing a PMC) or explicitly by soft- 
ware (mtspr). 


m A condition or event is enabled if the corre- 
sponding “Enable” bit in an MMCR is 1. The 
occurrence of an enabled condition or event can 
have side effects within the Performance Monitor, 
such as causing the PMCs to cease counting. 
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m An enabled condition or event causes a Perform- 
ance Monitor exception if Performance Monitor 
exceptions are enabled by the corresponding 
“Enable” bit in an MMCR. A single Performance 
Monitor exception may reflect multiple enabled 
conditions and events. 


m A Performance Monitor exception causes a Per- 
formance Monitor interrupt when MSRez=1. 


Programming Note 


The Performance Monitor can be effectively disa- 
bled (i.e., put into a state in which Performance 
Monitor SPRs are not altered and Performance 
Monitor interrupts do not occur) by setting 
MMCRO to 0x8000 0000. 


E.1 PMM Bit of the Machine 
State Register 


The Performance Monitor uses MSR bit PMM, which is 
defined as follows. 


Bit Description 
61 Performance Monitor Mark (PMM) 
This bit is a basic feature. 


This bit contains the Performance Monitor 
“mark” (0 or 1). 


If an mtmsr or mtmsrd instruction is executed that 
changes the value of the PMM bit, the change is not 
guaranteed to have taken effect until after a subse- 
quent context synchronizing instruction has been exe- 
cuted (see Chapter 7, “Synchronization Requirements 
for Special Registers and for Lookaside Buffers” on 
page 71). 


Programming Note 


Software can use this bit as a process-specific 
marker which, in conjunction with 
MMCROFcmo Fcm (see Section E.2.2), permits 
events to be counted on a process-specific basis. 
(The bit is saved by interrupts and restored by 
rfid.) 


Common uses of the PMM bit include the fol- 
lowing. 


m Count events for a few selected processes. 
This use requires the following bit settings. 


— MSRpym=1 for the selected processes, 
MSRpmm= O for all other processes 

— MMCROFcmo= 1 

— MMCROFcm= O 


m Count events for all but a few selected proc- 
esses. This use requires the following bit set- 
tings. 


— MSRpyy=1 for the selected processes, 
MSRpmm= O for all other processes 


— MMCROgoyo= 0 
— MMCROgoyi= 1 


Notice that for both of these uses a mark value of 
1 identifies the “few” processes and a mark value 
of 0 identifies the remaining “many” processes. 
Because the PMM bit is set to 0 when an interrupt 
occurs (see Figure 27 on page 54), interrupt han- 
dlers are treated as one of the “many”. If it is 
desired to treat interrupt handlers as one of the 
“few”, the mark value convention just described 
would be reversed. 


Architecture Note 


The two mark values (0 and 1) are equivalent 
except with respect to interrupts. That is, either 
mark value can be specified for a given process, 
and either mark value can control whether the 
PMCs are incremented, but interrupts always 
cause the mark value in the MSR to be set to 0 
(see Figure 27). 


Architecture Note 


No MSR bit is provided to disable the Perform- 
ance Monitor, because the Performance Monitor is 
considered a system-wide resource rather than a 
per-process resource. MMCRO can be used to 
achieve the effect of disabling the Performance 
Monitor, as described in the introduction to 
Appendix E. 
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E.2 Special Purpose Registers 


The Performance Monitor SPRs count events, control 
the operation of the Performance Monitor, and 
provide associated information. 


The Performance Monitor SPRs can be read and 
written using the mfspr and mtspr instructions (see 
Section 3.4.1, “Move To/From System Register 
Instructions” on page 16). The Performance Monitor 
SPR numbers are shown in Figure 35. Writing any of 
the Performance Monitor SPRs is privileged. Reading 
any of the Performance Monitor SPRs is not privileged 
(however, the privileged SPR numbers used to write 
the SPRs can also be used to read them; see the 
figure). 


The elapsed time between the execution of an instruc- 
tion and the time at which events due to that instruc- 
tion have been reflected in Performance Monitor SPRs 
is not defined. No means are provided by which soft- 
ware can ensure that all events due to preceding 
instructions have been reflected in Performance 
Monitor SPRs. Similarly, if the events being moni- 
tored may be caused by operations that are per- 
formed out-of-order, no means are provided by which 
software can prevent such events due to subsequent 
instructions from being reflected in Performance 
Monitor SPRs. Thus the value obtained by reading a 
Performance Monitor SPR may not be precise: it may 
fail to reflect some events due to instructions that 
precede the mfspr and may reflect some events due 
to instructions that follow the mfspr. This lack of pre- 
cision applies regardless of whether the state of the 
processor is such that the SPR is subject to change 
by the processor at the time the mfspr is executed. 


If an mtspr instruction is executed that changes the 
value of a Performance Monitor SPR other than SIAR 
or SDAR, the change is not guaranteed to have taken 
effect until after a subsequent context synchronizing 
instruction has been executed (see Chapter 7, “Syn- 
chronization Requirements for Special Registers and 
for Lookaside Buffers” on page 71). 


Programming Note 


Depending on the events being monitored, the 
contents of Performance Monitor SPRs may be 
affected by aspects of the runtime environment 
(e.g., cache contents) that are not directly attribut- 
able to the programs being monitored. 


SPR!2 
SPI5-9 SPro:4 
11000 n0010 
11000 n0011 
11000 n0100 
11000 n0101 
11000 n0110 
11000 n0111 
11000 n1000 
11000 n1001 
11000 n1010 
11000 n1011 
11000 n1100 
781,797 11000 n1101 
782,798 11000 n1110 


1 Note that the order of the two 5-bit halves 
of the SPR number is reversed. 

2 For mtspr, n must be 1. For mfspr, reading 
the SPR is privileged if and only if n=1. 


Register Privi- 
Name leged 


decimal 


770,786 
771,787 
772,788 
773,789 
774,790 
775,791 
776,792 
777,793 
778,794 
779,795 
780,796 


Figure 35. Performance Monitor SPR encodings for 
mtspr and mfspr 


E.2.1 Performance Monitor Counter 
Registers 
The eight Performance Monitor Counter registers, 


PMC1 through PMC8, are 32-bit registers that count 
events. 


31 


Figure 36. Performance Monitor Counter registers 
PMC1 and PMC2 are basic features. 


Normally each PMC is incremented each processor 
cycle by the number of times the corresponding event 
occurred in that cycle. Other modes of incrementing 
may also be provided (e.g., see the description of 
MMCRI bits PMC1HIST and PMCjHIST). 


“PMCj” is used as an abbreviation for “PMCi, i > 1”. 
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Programming Note 


Software can use a PMC to “pace” the collection 
of Performance Monitor data. For example, if it is 
desired to collect event counts every n cycles, 
software can specify that a particular PMC count 
cycles and set that PMC to 0x8000 0000 — n. The 
events of interest would be counted in other 
PMCs. The counter negative condition that will 
occur after n cycles can, with the appropriate 
setting of MMCR bits, cause counter values to 
become frozen, cause a Performance Monitor 


interrupt to occur, etc. 


Architecture Note 


Because they count events, the PMCs indirectly 
measure time and are therefore subject to the 
same requirements as the Time Base with respect 
to “covert channels” (see Section 6.2, “Time 
Base” on page 67). The requirements are satis- 


fied by MMCROrc (see Section E.2.2). 


Architecture Note 


The PMCs are numbered 1- 8, rather than 0- 7 
which would be more consistent with the num- 
bering in other register names, because early 
implementations of Performance Monitors num- 
bered them thus. 


E.2.2 Monitor Mode Control Register 0 


Monitor Mode Control Register 0 (MMCRO) is a 32-bit 
register. This register, along with MMCR1 and 
MMCRA, controls the operation of the Performance 
Monitor. 


| MMCRO | 


0 31 


Figure 37. Monitor Mode Control Register 0 


MMCRO is a basic feature. Within MMCRO, some of 
the bits and fields are basic features and some are 
extensions. The basic bits and fields are identified as 
such, below. 


Some bits of MMCRO are altered by the processor 
when various events occur, as described below. 


The bit definitions of MMCRO are as follows. MMCRO 
bits that are not implemented are treated as 
reserved. 


Bit(s) 
0 


A 


Description 
Freeze Counters (FC) 
This bit is a basic feature. 


0 The PMCs are incremented (if permitted 
by other MMCR bits). 
1 The PMCs are not incremented. 


The processor sets this bit to 1 when an 
enabled condition or event occurs and 
MMCROFcecg= 1. 


Freeze Counters in Supervisor State (FCS) 
This bit is a basic feature. 


0 The PMCs are incremented (if permitted 
by other MMCR bits). 

1 The PMCs are not 
MSRpp=0. 


Freeze Counters in Problem State (FCP) 


incremented if 


This bit is a basic feature. 


O The PMCs are incremented (if permitted 
by other MMCR bits). 

1 The PMCs are not 
MSRpp= 1. 


Freeze Counters while Mark = 1 (FCM1) 


This bit is a basic feature. 


incremented if 


O The PMCs are incremented (if permitted 
by other MMCR bits). 

1 The PMCs are not 
MSRpum= 1. 


Freeze Counters while Mark = 0 (FCMO) 


This bit is a basic feature. 


incremented if 


O The PMCs are incremented (if permitted 
by other MMCR bits). 


1 The PMCs are not incremented if 
MSRpum=0. 

Performance Monitor Exception Enable 

(PMXE) 


This bit is a basic feature. 


O Performance Monitor exceptions are disa- 
bled. 

1 Performance Monitor exceptions are 
enabled until a Performance Monitor 
exception occurs, at which time: 

E MMCROpyxe is set to O 


Programming Note 


Software can set this bit to O to prevent 
Performance Monitor interrupts. 


Software can set this bit to 1 and then poll 
the bit to determine whether an enabled 
condition or event has occurred. This is 
especially useful on an implementation 
that does not provide the Performance 
Monitor interrupt. 
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7:8 


10:15 


Freeze Counters on Enabled Condition or 
Event (FCECE) 


O The PMCs are incremented (if permitted 
by other MMCR bits). 

1 The PMCs are incremented (if permitted 
by other MMCR bits) until an enabled con- 
dition or event occurs when 
MMCROrrigger=0, at which time: 

m MMCROrç is set to 1 


If the enabled condition or event occurs when 
MMCROrrigger= 1, the FCECE bit is treated 
as if it were 0. 


Time Base Selector (TBSEL) 


This field selects the Time Base bit that can 
cause a Time Base transition event (the event 
occurs when the selected bit changes from 0 
to 1). 


00 Time Base bit 63 is selected. 
01 Time Base bit 55 is selected. 
10 Time Base bit 51 is selected. 
11 Time Base bit 47 is selected. 


Programming Note 


Time Base transition events can be used 
to collect information about processor 
activity, as revealed by event counts in 
PMCs and by addresses in SIAR and 
SDAR, at periodic intervals. 


In multiprocessor systems in which the 
Time Base registers are synchronized 
among the processors, Time Base transi- 
tion events can be used to correlate the 
Performance Monitor data obtained by the 
several processors. For this use, software 
must specify the same TBSEL value for all 
the processors in the system. 


Because the frequency of the Time Base 
is implementation-dependent, software 
should invoke a system service program 
to obtain the frequency before choosing a 
value for TBSEL. 


Time Base Event Enable (TBEE) 


O Time Base transition events are disabled. 
1 Time Base transition events are enabled. 


Threshold (THRESHOLD) 


This field contains a “threshold value”, which 
is a value such that only events that exceed 
the value are counted. The events to which a 
threshold value can apply are implementa- 
tion-dependent, as are the dimension of the 
threshold (e.g., duration in cycles) and the 
granularity with which the threshold value is 
interpreted. See the Book IV, PowerPC AS 
Implementation Features document for the 
implementation. 


16 


Programming Note 


By varying the threshold value, software 
can obtain a profile of the characteristics 
of the events subject to the threshold. 
For example, if PMC1 counts the number 
of cache misses for which the duration 
exceeds the threshold value, then soft- 
ware can obtain the distribution of cache 
miss durations for a given program by 
monitoring the program repeatedly using 
a different threshold value each time. 


Engineering Note 


A desirable use of THRESHOLD is to 
obtain a profile of the durations of cache 
misses. 


It is recommended that one or two bits in 
a HID register be provided that permit 
software to control the granularity with 
which the THRESHOLD value is inter- 
preted. For example, if one bit is pro- 
vided the value O could specify a 


granularity of 1 and the value 1 could 
specify a granularity of 32. 


PMC1 Condition Enable (PMC1CE) 


This bit controls whether counter negative 
conditions due to a negative value in PMC1 
are enabled. 


O Counter negative conditions for PMC1 are 
disabled. 

1 Counter negative conditions for PMC1 are 
enabled. 


PMCj Condition Enable (PMCjCE) 


This bit controls whether counter negative 
conditions due to a negative value in any 
PMCj (i.e., in any PMC except PMC1) are 
enabled. 


O Counter negative conditions for all PMCjs 
are disabled. 

1 Counter negative conditions for all PMCjs 
are enabled. 


Trigger (TRIGGER) 


O The PMCs are incremented (if permitted 
by other MMCR bits). 

1 PMC1 is incremented (if permitted by 
other MMCR bits). The PMCjs are not 
incremented until PMC1 is negative or an 
enabled condition or event occurs, at 
which time: 

m the PMCjs resume incrementing (if 
permitted by other MMCR bits) 
| MMCROrriGGER is set to O 


See the description of the FCECE bit, above, 
regarding the interaction between TRIGGER 
and FCECE. 
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19:25 


26:31 


Programming Note 
Uses of TRIGGER include the following. 


m Resume counting in the PMCjs when 
PMC1 becomes negative, without 
causing a Performance Monitor inter- 
rupt. Then freeze all PMCs (and 
optionally cause a Performance 
Monitor interrupt) when a PMCj 
becomes negative. The PMCjs then 
reflect the events that occurred 
between the time PMC1 became neg- 
ative and the time a PMCj becomes 
negative. This use requires the fol- 
lowing MMCRO bit settings. 


— TRIGGER=1 

— PMC1CE=0 

— PMCjCE=1 

— TBEE=0 

— FCECE=1 

— PMXE=1 (if a Performance 
Monitor interrupt is desired) 


m Resume counting in the PMCjs when 
PMC1 becomes negative, and cause a 
Performance Monitor interrupt without 
freezing any PMCs. The PMCjs then 
reflect the events that occurred 
between the time PMC1 became neg- 
ative and the time the interrupt 
handler reads them. This use 
requires the following MMCRO bit set- 
tings. 


— TRIGGER=1 
— PMC1CE=1 
— TBEE=0 
— FCECE=0 
— PMXE=1 


PMC1 Selector (PMC1SEL) 
This field is a basic feature. 


This field contains a code (one of at most 128 
values) that identifies the event to be counted 
in PMC1; see the Book IV, PowerPC AS Imple- 
mentation Features document for the imple- 
mentation. 


PMC2 Selector (PMC2SEL) 
This field is a basic feature. 


This field contains a code (one of at most 64 
values) that identifies the event to be counted 
in PMC2; see Book IV. 


E.2.3 Monitor Mode Control Register 1 


Monitor Mode Control Register 1 (MMCR1) is a 32-bit 
register. This register, along with MMCRO and 
MMCRA, controls the operation of the Performance 
Monitor. 


MMCR1 


0 31 


Figure 38. Monitor Mode Control Register 1 
MMCRI is a basic feature. 


Some bits of MMCRI are altered by the processor 
when various events occur, as described below. 


The bit definitions of MMCRI are as follows. MMCR1 
bits that are not implemented are treated as 
reserved. 


Bit(s) Description 


0:4 PMC3 Selector (PMC3SEL) 
5:9 PMC4 Selector (PMC4SEL) 
10:14  PMC5 Selector (PMC5SEL) 
15:19 PMC6 Selector (PMC6SEL) 
20:24 PMC7 Selector (PMC7SEL) 


Each of these fields contains a code (one of at 
most 32 values) that identifies the event to be 
counted in PMCs 3 through 7 respectively; see 
Book IV. 


25:28 PMC8 Selector (PMC8SEL) 


This field contains a code (one of at most 16 
values) that identifies the event to be counted 
in PMC8; see Book IV. 


29 Freeze Counters until IABR Match (FCUIABR) 


O The PMCs are incremented (if permitted 
by other MMCR bits). 

1 The PMCs are not incremented until a 
“monitored” IABR match occurs. An IABR 
match is said to be “monitored” if it 
occurs when PMC incrementing is per- 
mitted by MMCROç4 and MSRpR pum: 
When a monitored IABR match occurs: 

m the PMCs resume incrementing (if 
permitted by other MMCR bits) 
E MMCR1 EcuiaBR is set to 0 


The IABR (Instruction Address Breakpoint 
Register) is an implementation-specific SPR, 
and the definition of “IABR match” is imple- 
mentation-dependent; see the Book IV, 
PowerPC AS Implementation Features docu- 
ment for the implementation. 


30 PMC1 History Mode (PMC1HIST) 


This bit controls whether PMC1 is incre- 
mented in the normal way, described in 
Section E.2.1, or in “history mode”. In history 
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mode a PMC is shifted left by one bit each 
processor cycle, and the vacated low-order bit 
is set to 1 if the associated event occurred 
(one or more times) in that cycle and is set to 
0 otherwise. 


O PMC1 is incremented normally (if incre- 
menting is permitted by other MMCR bits). 

1 PMC1 is incremented in history mode (if 
incrementing is permitted by other MMCR 
bits). 


31 PMCj History Mode (PMCjHIST) 


This bit controls whether all PMCjs are incre- 
mented in the normal way, described in 
Section E.2.1, or in “history mode”, described 
under PMC1HIST above. 


O All PMCjs are incremented normally (if 
incrementing is permitted by other MMCR 
bits). 

1 All PMCjs are incremented in history 
mode (if incrementing is permitted by 
other MMCR bits). 


E.2.4 Monitor Mode Control Register A 


Monitor Mode Control Register A (MMCRA) is a 32-bit 
register. This register, along with MMCRO and 
MMCR1, controls the operation of the Performance 
Monitor. 


MMCRA 
0 31 


Figure 39. Monitor Mode Control Register A 


MMCRA is a basic feature. Within MMCRA, some of 
the bits and fields are basic features and some are 
extensions. The basic bits and fields are identified as 
such, below. 


The bit definitions of MMCRA are as follows. MMCRA 


bits that are not implemented are treated as 
reserved. 

Bit(s) Description 

0 Multithread Count Mode (MODE) 


O Global Mode: All PMCs count all threads 

(no thread active gating). 
Example: If MMCRO is programmed to 
have PMC1 count instructions executed, 
PMC1 will count instructions executed by 
both thread 0 and 1. 

1 Thread Mode: PMC1 - PMC4 count events 
tor thread 0. PMC5 - PMC8 count the 
same events for thread 1. 

Example: If MMCRO is programmed to 
have PMC1 count instructions executed, 
PMC1 will count instructions executed both 


8:14 


17:23 
24:27 
28:29 
30 


thread 0, and and PMC5 will count instruc- 
tion executed by thread 1. 


When MODE = 1, the PMC SPR addressing 
changes. 


m For thread 0, PMC1 - PMC4 (Performance 
Monitor Counter registers 1 - 4) are 
addressed using PMC1 PMC4 SPR 
addresses from Figure 35 on page 95. 
The results of mfspr or mtspr instructions 
that use a PMC5 - PMC8 SPR address are 
implementation-dependent. 

m For thread 1, PMC5 - PMC8 are addressed 
using PMC1 - PMC4 SPR addresses from 
Figure 35 on page 95. The results of 
mfspr or mtspr instructions that use a 
PMC5 - PMC8 SPR address are implemen- 
tation-dependent. 


Freeze Counters 1-4 (FC1-4) 


0 PMC1 - PMC4 are incremented (if per- 
mitted by other MMCR bits). 
1 PMC1 - PMC4 are not incremented. 


Freeze Counters 5-8 (FC5-8) 


0 PMC5 - PMC8 are incremented (if per- 
mitted by other MMCR bits). 
1 PMC5 - PMC8 are not incremented. 


Reserved 
Reserved for implementation-specific use 


External Performance Monitor 


(EPMX) 


Set to 1 if an External Performance Monitor 
exception is received. This bit can be set to O 
only by the mtspr instruction. Software 
should set this bit to 0 after handling the 
external event. 


Exception 


External Performance Monitor 
Enable (EPMXE) 


O External Performance Monitor exceptions 
are disabled. 

1 External Performance Monitor exceptions 
are enabled. 


Exception 


An external signal can be driven by other 
components in the system to signal an excep- 
tion when one or more of their counters has 
its high-order bit set to 1. 


Reserved 

Reserved for implementation-specific use 
Reserved 

Freeze Counters in Wait State (FCWAIT) 
This bit is a basic feature. 


O The PMCs are incremented (if permitted 
by other MMCR bits). 
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1 The PMCs are not incremented if 
CTRLg=0. Software is expected to set 
CTRL3;=0 when it is in a “wait state”, i.e, 
when there is no process ready to run. 


Only Branch Unit type of events do not incre- 
ment if CTRL3;=0. Other units continue to 
count. 


31 Reserved 


E.2.5 Sampled Instruction Address 
Register 


The Sampled Instruction Address Register (SIAR) is a 
64-bit register. It contains the address of the 
“sampled instruction” when a Performance Monitor 
exception occurs. 


SIAR 


0 63 


Figure 40. Sampled Instruction Address Register 


When a Performance Monitor exception occurs, SIAR 
is set to the effective address of an instruction that 
was executing, possibly out-of-order, at or around the 
time that the Performance Monitor exception 
occurred. This instruction is called the “sampled 
instruction”. 


The contents of SIAR may be altered by the processor 
if and only if MMCROpyye=1. Thus after the Perform- 
ance Monitor exception occurs, the contents of SIAR 
are not altered by the processor until software sets 
MMCROpmxg to 1. After software sets MMCROpyxe to 
1, the contents of SIAR are undefined until the next 
Performance Monitor exception occurs. 


See Section E.4 regarding the effects of the Trace 
facility on SIAR. 


Engineering Note 


If the Performance Monitor exception is caused by 
an enabled counter negative condition that can be 
associated with the execution of a specific instruc- 
tion, it is preferable to set SIAR to that 
instruction's address. 


E.2.6 Sampled Data Address 
Register 


The Sampled Data Address Register (SDAR) is a 
64-bit register. It contains the address of the 
“sampled data” when a Performance Monitor excep- 
tion occurs. 
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SDAR 


0 63 


Figure 41. Sampled Data Address Register 


When a Performance Monitor exception occurs, SDAR 
is set to the effective address of the storage operand 
of an instruction that was executing, possibly out-of- 
order, at or around the time that the Performance 
Monitor exception occurred. This storage operand is 
called the “sampled data”. The sampled data may 
be, but need not be, the storage operand (if any) of 
the “sampled instruction” (see Section E.2.5). If the 
Performance Monitor exception causes a Performance 
Monitor interrupt, SRR1 indicates whether the 
sampled data is in fact the storage operand of the 
sampled instruction (see Section E.3). 


The contents of SDAR may be altered by the 
processor if and only if MMCROpyyxe=1. Thus after 
the Performance Monitor exception occurs, the con- 
tents of SDAR are not altered by the processor until 
software sets MMCROpyye to 1. After software sets 
MMCROpuxe to 1, the contents of SDAR are undefined 
until the next Performance Monitor exception occurs. 


See Section E.4 regarding the effects of the Trace 
facility on SDAR. 


Engineering Note 


If the sampled instruction has a storage operand, 
it is preferable to set SDAR to that storage oper- 
and's address. 


E.3 Performance Monitor 
Interrupt 


The Performance Monitor interrupt is a system- 
caused interrupt (see Section 5.3, “Interrupt Classes” 
on page 52). It is masked by MSRggz in the same 
manner that External and Decrementer interrupts are. 


A Performance Monitor interrupt occurs when no 
higher priority exception exists, a Performance 
Monitor exception exists, and MSRee=1. The occur- 
rence of the interrupt cancels the exception (i.e., 
causes the exception to cease to exist). 


If multiple Performance Monitor exceptions occur 
before the first causes a Performance Monitor inter- 
rupt, the interrupt reflects the most recent Perform- 
ance Monitor exception and the preceding 
Performance Monitor exceptions are lost. 
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The following registers are set: 


SRRO Set to the effective address of the instruc- 
tion that the processor would have 
attempted to execute next if no interrupt 
conditions were present. 


SRR1 

33 Set to 1 if the contents of SIAR and SDAR 
are associated with the same instruction 
(i.e., if SDAR contains the effective address 
of the storage operand of the “sampled 
instruction”); otherwise set to 0 (including 
the case in which the “sampled 
instruction” has no storage operand). 

34:36 and 42:47 See the Book IV, PowerPC AS 
Implementation Features document for the 
implementation. 

Others Loaded from the MSR. 


Engineering Note 


SRR1 bits 34:36 and 42:47 can be used to 
provide information about the state of the 
processor at the time the “sampled instruction” 
was being executed or at the time the Perform- 
ance Monitor exception is generated. 


MSR See Figure 27 on page 54. 


SIAR Set to the effective address of the 
“sampled instruction” (see Section E.2.5). 


SDAR Set to the effective address of the 
“sampled data” (see Section E.2.6). 
address 


Execution resumes at effective 


0x0000_0000_0000_0F00. 


In general, statements about External and 
Decrementer interrupts elsewhere in this Book apply 
also to the Performance Monitor interrupt; for 
example, if a Performance Monitor exception is 
pending when an mtmsr or mtmsrd instruction is exe- 
cuted that changes MSRgg from O to 1, the Perform- 
ance Monitor interrupt will occur before the next 
instruction is executed (if no higher priority exception 
exists). 


The priority of the Performance Monitor interrupt is 
between that of the External interrupt and that of the 
Decrementer interrupt (see Section 5.7.2, “Ordered 
Exceptions” on page 64 and Section 5.8, “Interrupt 
Priorities” on page 65). 


E.4 Interaction with the Trace 
Facility 


If the Trace facility includes setting SIAR and SDAR 
(see Appendix F, “Example Trace Extensions 
(Optional)” on page 103), and tracing is active 
(MSRsp=1 or MSRge-1), the contents of SIAR and 
SDAR as used by the Performance Monitor facility are 
undefined and may change even when 
MMCROpyxe=9, and the contents of SRR133 when a 
Performance Monitor interrupt occurs are also unde- 
fined. 


Programming Note 


A potential combined use of the Trace and Per- 
formance Monitor facilities is to trace the control 
flow of a program and simultaneously count 
events for that program. 


E.5 Synchronization 
Requirements for Performance 
Monitor SPRs 


Any requirements for synchronizing the effect of 
loading Performance Monitor SPRs is implementa- 
tion-dependent. 
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Appendix F. Example Trace Extensions (Optional) 


This appendix provides an example of extensions that 
may be added to the Trace facility described in 
Section 5.5.13, “Trace Interrupt” on page 63. It is 
only an example; implementations may provide all, 
some, or none of the features described here, or may 
provide features that are similar to those described 
here but differ in detail. See the Book IV, PowerPC 
AS Implementation Features document for the imple- 
mentation. 


The extensions consist of the following features 
(described in detail below). 


m use of MSRse pp=0b11 to specify new causes of 
Trace interrupts 


m specification of how certain SRR1 bits are set 
when a Trace interrupt occurs 


m setting of SIAR and SDAR (see Appendix E) when 
a Trace interrupt occurs 


MSR BE = 0b11 


If MSRse ge=0b11, the processor generates a Trace 
exception under the conditions described in Section 
5.5.13 for MSRse ge=0b01, and also after successfully 
completing the execution of any instruction that would 
cause at least one of SRR1 bits 33:36, 42, and 44:46 to 
be set to 1 (see below) if the instruction were exe- 
cuted when MSRgg pe=0b10. 


This overrides the implicit statement in Section 5.5.13 
that the effects of MSRsg ge=0b11 are the same as 
those of MSRsg pe=0b10. 


SRR1 


When a Trace interrupt occurs, the SRR1 bits that are 
not loaded from the MSR are set as follows instead of 
as described in Section 5.5.13. 


33 Set to 1 if the traced instruction is icbi; oth- 
erwise set to 0. 

34 Set to 1 if the traced instruction is debt, 
dcbtst, dcbz, dcbst, or dcbf; otherwise set 
to O. 

35 Set to 1 if the traced instruction is a Load 


instruction or eciwx; may be set to 1 if the 
traced instruction is icbi, dcbt, dcbist, 
dcbst, or dcbf; otherwise set to 0. 


36 Set to 1 if the traced instruction is a Store 
instruction, dcbz, or ecowx; otherwise set 
to O. 

42 Set to 1 if the traced instruction is Iswx or 
stswx; otherwise set to 0. 

43 See the Book IV, PowerPC AS Implementa- 
tion Features document for the implemen- 
tation. 

44 Set to 1 if the traced instruction is a 
Branch instruction and the branch is taken; 
otherwise set to 0. 

45 Set to 1 if the traced instruction is eciwx or 
ecowx; otherwise set to 0. 

46 Set to 1 if the traced instruction is Iwarx, 
Idarx, stwcx., or stdcx.; otherwise set to 0. 

47 See the Book IV, PowerPC AS Implementa- 
tion Features document for the implemen- 
tation. 


Engineering Note 


The setting of bit 44 as specified above is not 
expected to be provided on implementations that 
fold branches. 


Bits 43 and 47 can be used to provide information 
about the state of the processor at the time the 
traced instruction was being executed. 


SIAR and SDAR 


If the optional Performance Monitor facility is imple- 
mented and includes SIAR and SDAR (see 
Appendix E, “Example Performance Monitor 
(Optional)” on page 93), the following additional reg- 
isters are set when a Trace interrupt occurs: 


SIAR Set to the effective address of the traced 
instruction. 
SDAR Set to the effective address of the storage 


operand (if any) of the traced instruction; 
otherwise undefined. 


If the state of the Performance Monitor is such that 
the Performance Monitor may be altering these regis- 
ters (i.e., if MMCROpyyxe=1), the contents of SIAR and 
SDAR as used by the Trace facility are undefined and 
may change even when no Trace interrupt occurs. 
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Engineering Note 


On an implementation for which the Performance 
Monitor permits the number of instructions com- 
pleted between successive Trace interrupts to be 
counted exactly, the setting of SIAR as described 
above is not needed. 


It is acceptable for SDAR not to be set as speci- 
fied above under certain conditions (e.g., for a 
Storage Access instruction that causes a Data 
Storage interrupt). 
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Appendix G. PowerPC AS Operating Environment Instruction 


Set 


Form inte Mode Priv.? | Page | Mnemonic 
Primary | Extend | Dep. 

X 31 83 P 19 | mfmsr 

XFX 31 339 O 18 | mfspr 

X 31 595 32 P 82 | mfsr 

X 31 659 32 P 82 | mfsrin 

X 31 146 P 78 | mtmsr 

X 31 178 P 19 | mtmsrd 

XFX 31 467 O 17 | mtspr 

X 31 210 32 P 81 | mtsr 

X 31 242 32 P 81 | mtsrin 

XL 19 18 P 12 | rfid 

SC 17 11 | sc 

X 31 498 P 44 | slbia 

X 31 434 P 43 | slbie 

X 31 915 P 46 | slbmfee 

X 31 851 P 46 | slbmfev 

X 31 402 P 45 | slbmte 

X 31 370 P 48 | tlbia 

X 31 306 64 H 47 | tlbie 

X 31 566 H 48 | tlbsync 


Instruction 


Move From Machine State Register 
Move From Special Purpose Register 
Move From Segment Register 

Move From Segment Register Indirect 
Move To Machine State Register 
Move To Machine State Register Doubleword 
Move To Special Purpose Register 
Move To Segment Register 

Move To Segment Register Indirect 
Return From Interrupt Doubleword 
System Call 

SLB Invalidate All 

SLB Invalidate Entry 

SLB Move From Entry ESID 

SLB Move From Entry VSID 

SLB Move To Entry 

TLB Invalidate All 

TLB Invalidate Entry 

TLB Synchronize 


1Key to Mode Dependency Column 


Except as described below and in the section entitled 
“Effective Address Calculation” in Book |, all 
instructions in the PowerPC AS Operating Environ- 
ment Architecture are independent of whether the 
processor is in 32-bit or 64-bit mode. 


32 The instruction must be executed only in 
32-bit mode. 

64 The instruction must be executed only in 
64-bit mode. 


2Key to Privilege Column 

P denotes a privileged instruction. 

O denotes an instruction that may be treated as privi- 
leged or nonprivileged (or hypervisor, for mtspr), 


depending on the SPR number. 


H denotes an instruction that can be executed only in 
hypervisor state. 
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Index 


a 


ACCR 35 
address 
effective address 21 
real 23, 26 
address compare 22, 56 
ACCR 35 
Address Compare Control Register 17, 18, 35 
Address Space Register 17, 18, 79 
address translation 37 
EA to VA 25 
esid to vsid 25 
overview 28 
PTE 
page table entry 32,37 
Reference bit 37 
RPN 
real page number 31 
VAtoRA 31 
VPN 
virtual page number 31 
32-bit mode 25 
addresses 
accessed by processor 27 
implicit accesses 27 
interrupt vectors 27 
with defined uses 27 
Alignment interrupt 59, 91 
ASR 79 
assembler language 
extended mnemonics 83 
mnemonics 83 
symbols 83 


BE 
See Machine State Register 
Branch Trace 63 
Bridge 79 
ASR 79 
Segment Registers 80 
SR 80 


Caching Inhibited 22 
Change bit 37 
CIA 
See Current Instruction Address 
context 
definition 1 
synchronization 3 
Control Register O 14,17, 18 
CTRL 
See Control Register 
Current Instruction Address 7,11 


o] 


DABR interrupt 76 
DAR 
See Data Address Register 
data access 23 
Data Address Breakpoint Register 17, 18, 76 
data address compare 56 
ACCR 35 
Data Address Register 13, 17, 18, 53, 56, 57, 60 
Data Segment interrupt 57 
Data Storage interrupt 56 
Data Storage Interrupt Status Register 14, 17, 18, 
56, 59, 60, 91 
Alignment interrupt 91 
dcbf instruction 56 
dcbst instruction 56 
dcbz instruction 35, 41, 56, 59, 91 
Decrementer 17, 18, 69 
Decrementer interrupt 19, 62, 78 
DR 
See Machine State Register 
DSISR 
See Data Storage Interrupt Status Register 


E] 


E (Enable bit) 75 
eciwx instruction 75, 56, 59, 60 
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ecowx instruction 75, 56, 59, 60 
EE 

See Machine State Register 
effective address 21, 28 

size 22 

translation 28 
eieio instruction 49 
emulation assist 2, 53 
exceptions 

address compare 22, 35, 56 

definition 1 

page fault 22, 34, 56 

protection 22 

segment fault 22 

storage 22 
execution synchronization 4 
External Access Register 75, 17, 18, 56 
External interrupt 19, 59, 78 


FF| 


FEO 

See Machine State Register 
FE1 

See Machine State Register 
Floating-Point Unavailable interrupt 62 
FP 

See Machine State Register 


+! 


hardware 
definition 2 
hashed page table 32 
size 33 
HTAB 
See hashed page table 
HTABORG 33 
HTABSIZE 33 
hypervisor 4 
page table 32 


N 


icbi instruction 56 
ILE 

See Machine State Register 
implicit branch 23 
imprecise interrupt 52 
in-order operations 23 
instruction fetch 23 

effective address 23 

implicit branch 23 
Instruction Segment interrupt 58 
Instruction Storage interrupt 58 
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instruction-caused interrupt 52 
instructions 


dcbf 56 
dcbst 56 
dcbz 35, 41, 56, 59, 91 
eciwx 75, 56, 59, 60 
ecowx 75, 56, 59, 60 
eieio 49 
icbi 56 
isync 4, 52, 53, 61 
Idarx 53, 56, 59, 60 
Imw 59,60 
lookaside buffer 41 
Iswi 60 
Iswx 60 
Iwa 60 
Iwarx 53, 56, 59, 60, 91 
Iwaux 60 
Iwz 91 
mfmsr 7, 19 
mfspr 18 
mfsr 82 
mfsrin 82 
mtmsr 4, 7, 65, 78 
mtmsrd 4,7, 19, 65 
mtspr 17 
mtsr 81 
mtsrin 81 
optional 

See optional instructions 
rfid 7, 12, 53, 63, 65 
sc 11, 62 
slbia 44 
slbie 43 
slbmfee 46 
slomfev 46 
slomte 45 
stdcx. 53, 56, 59, 60 
stmw 59 
storage control 41 
stw 91 
stwcx. 53, 56, 59, 60 
stwx 91 
sync 4, 37, 49, 52, 53, 61 
tibia 35, 48 
tlbie 35, 47, 48, 49 
tlbsync 48, 49 


interrupt 


Alignment 59, 91 
DABR 76 

Data Segment 57 
Data Storage 56 
Decrementer 19, 62, 78 
definition 2 

External 19, 59, 78 
Floating-Point Unavailable 62 
imprecise 52 
Instruction Segment 58 
Instruction Storage 58 
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interrupt (continued) Machine State Register (continued) 
instruction-caused 52 FP FP Available 9 
Machine Check 55 ILE Interrupt Little-Endian Mode 8 
new MSR 54 IR Instruction Relocate 9 
overview 51 LE Little-Endian Mode 9 
Performance Monitor 63 ME Machine Check Enable 9 
precise 52 PMM Performance Monitor Mark 9,94 
priorities 65 PR Problem State 8 
processing 53 RI Recoverable Interrupt 9, 19, 78 
Program 60 SE  Single-Step Trace Enable 9 
recoverable 53 SF Sixty Four Bit mode 8 
synchronization 51 Machine Status Save Restore Register 
System Call 62 See SRRO, SRR1 
System Reset 55 Machine Status Save Restore Register 0 7, 17, 18, 
system-caused 52 52, 53 
Trace 63 Machine Status Save Restore Register 1 7, 17, 18, 
vector 53, 54 53, 61 
IR ME 
See Machine State Register See Machine State Register 
isync instruction 4, 52, 53, 61 Memory Coherence Required 22 
mfmsr instruction 7, 19 
mfspr instruction 18 
[K | mfsr instruction 82 
mfsrin instruction 82 
K bits 40 mnemonics 
key, storage 40 extended 83 
MSR 
See Machine State Register 
mtmsr instruction 4, 7, 65, 78 


mtmsrd instruction 4, 7, 19, 65 
mtspr instruction 17 

mtsr instruction 81 

mtsrin instruction 81 


large page 29 
Idarx instruction 53, 56, 59, 60 
LE 

See Machine State Register 
Imw instruction 59, 60 


Logical Partition Identity Register 4 
Logical Partitioning 4 
lookaside buffer 41 Next Instruction Address 7, 11, 12 
lookaside buffers 71 NIA 
LPAR (see Logical Partitioning) 4 See Next Instruction Address 
LPES bit 4 
LPIDR 4 
Iswi instruction 60 [o] 
Iswx instruction 60 
Iwa instruction 60 opcode O 91 
Iwarx instruction 53, 56, 59, 60, 91 optional facilities 79 
Iwaux instruction 60 optional instructions 41, 75 
Iwz instruction 91 slbia 44 
slbie 43 
tlbia 48 
[m] tlbie 47 
tlbsync 48 
Machine Check interrupt 55 out-of-order operations 23 


Machine State Register 7, 11, 19, 52, 53, 54, 78 
BE Branch Trace Enable 9 


DR Data Relocate 9 [P| 

EE External Interrupt Enable 8, 19, 78 

FEO FP Exception Mode 9 page 

FE1 FP Exception Mode 9 size 22 
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page fault 22, 34, 56 
page size 

large page 29 
page table 

See also hashed page table 

search 34 

update 49 
page table entry 32,37 

Change bit 37 

PPbits 40 

Reference bit 37 

update 49 
partition 4 
Performance Monitor interrupt 63 
PMM 

See Machine State Register 
PPbits 40 
PR 

See Machine State Register 
precise interrupt 52 
priority of interrupts 65 
Processor ID Register 15, 18 
Processor Version Register 15, 18 
Program interrupt 60 
protection boundary 40, 60 
protection domain 40 
PTE 34 

See also page table entry 
PTEG 34 
PVR 

See Processor Version Register 


e! 


RC bits 37 
real address 26, 28 
Real Mode Caching Inhibited bit 4 
Real Mode Limit Register 4 
Real Mode Offset Register 4 
real page 
definition 1 
real page number 32 
recoverable interrupt 53 
reference and change recording 37 
Reference bit 37 
registers 
ACCR 
Address Compare Control Register 17, 18 
ASR 
Address Space Register 17, 18 
CTRL 
Control Register O 14,17, 18 
DABR 
Data Address Breakpoint Register 17, 18, 76 
DAR 
Data Address Register 13, 17, 18, 53, 56, 57, 60 
DEC 
Decrementer 17, 18, 69 


registers (continued) 
DSISR 
Data Storage Interrupt Status Register 14, 17, 
18, 56, 59, 60, 91 
EAR 
External Access Register 75, 17, 18, 56 
MSR 
Machine State Register 7, 11, 19, 52, 53, 54, 78 
optional 75 
PIR 
Processor ID Register 15, 18 
PVR 
Processor Version Register 15, 18 
SDR1 
Storage Description Register 1 
Segment Registers 71 
SPRGn 
software-use SPRs 14, 17, 18 
SPRs 71 
SRRO 
Machine Status Save Restore Register 0 7,17, 
18, 52, 53 
SRR1 
Machine Status Save Restore Register 1 7,17, 
18, 53, 61 
status and control 71 
TB 
Time Base 67 
TBL 
Time Base Lower 17, 67 
TBU 
Time Base Upper 17,67 
relocation 
data 23 
reserved field 2 
rfid instruction 7, 12, 53, 63, 65 
RI 
See Machine State Register 
RID (Resource ID) 75 
RMLR 4 
RMOR 4 


[s| 


sc instruction 11, 62 
SDR1 
See Storage Description Register 1 
SE 
See Machine State Register 
segment 
size 22 
type 22 
Segment Lookaside Buffer 
See SLB 
Segment Registers 71, 80 
Segment Table 
bridge 79 


17, 18, 33 
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sequential execution model System Call interrupt 62 
definition 2 System Reset interrupt 55 
SF system-caused interrupt 52 


See Machine State Register 


Single-Step Trace 63 
SLB 29,41 
entry 29 


slbia instruction 44 table update 49 

slbie instruction 43 Time Base 67 

slbmfee instruction 46 Time Base Lower 17, 67 

slbmfev instruction 46 Time Base Upper 17, 67 

slbmte instruction 45 TLB 35, 41 

software-use SPRs 14, 17, 18 tibia instruction 35, 48 

speculative operations 23 tibie instruction 35, 47, 48, 49 

SPRGn tlbsync instruction 48, 49 
See software use SPRs Trace interrupt 63 

SPRs 71 translation lookaside buffer 35 

SR 80 trap interrupt 

status and control registers 71 definition 2 


stdex. instruction 53, 56, 59, 60 


stmw instruction 59 
storage 
accessed by processor 27 virtual address 28, 31 


i 22 
aag generation 28 


Guarded 40 aa 22 b 32 
implicit accesses 27 virtual page numoer 


interrupt vectors 27 


K 40 
key 40 


N 34, 40 Write Through Required 22 
no-execute 34, 40 

ordering 22 

PR 40 

protection 40 32-bit mode 25 


translation disabled 40 
weak ordering 22 
with defined uses 27 
storage control 
instructions 41 
storage control bits 36 
Storage Description Register 1 17, 18, 33 
storage key 40 
storage model 22 
storage operations 
in-order 23 
out-of-order 23 
speculative 23 
storage protection 40 
stw instruction 91 
stwex. instruction 53, 56, 59, 60 
stwx instruction 91 
symbols 83 
sync instruction 4, 37, 49, 52, 53, 61 
synchronization 3, 49, 71 
context 3 
execution 4 
interrupts 51 
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